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b) determining whether the^st compound binds to the polypeptide, thereby 
identifying a compound which biiuis to a polypeptide comprising the amino acid sequence of 
SEQ IDNO:20. 



REMARKS 

Claims 1-19 and 22-45 were pending in the application. Claims 1-19 and 22-26 have been 
cancelled, without prejudice, and new claims 46-49 have been added. Accordingly, after the 
amendments presented herein have been entered, claims 27-49 will remain pending. For the 
Examiner's convenience all of the pending claims are set forth herein in Appendix A. 

Support for the newly added claims can be found throughout the specification and claims as 
originally filed. Specifically, support for new claims 46, 47, 48 and 49 can be found in claims 27, 
28, 37 and 38 as originally filed, respectively. Further support for newly added claims 46-49 can be 
found at page 11, lines 10-24, and page 8, lines 35-38 of the specification. 

No new matter has been added. Any cancellation of the claims should in no way be 
construed as an acquiescence to any of the Examiner's rejections and was done solely to expedite the 
prosecution of the application. Applicant reserves the right to pursue the claims as originally filed in 
this or a separate application(s). 

Withdrawal From Consideration of Claims 28, 30, 32-36, 38, 40, and 42-45 As 
Being Directed to a Non-Elected Invention 

The Examiner has withdrawn claims 28, 30, 38, 40, and claims 32-36 and 42-45 in so far as 
they depend from any of claims 28, 30, 38, and 40, as being directed toward an invention that is 
independent or distinct from the invention originally claimed. 

Applicant traverses the withdrawal from consideration of these claims for the following 
reasons. The polypeptide that the Examiner believes is independent from the invention that was 
originally claimed (SEQ ID NO:20) is a splice variant of claimed polypeptide (SEQ ID NO:5). 
Both SEQ ID NO:20 and SEQ ID NO:5 are human capsaicin/vanilloid receptor polypeptides and 
share a high degree of sequence identity. The two polypeptides are identical with the exception of 
164 residues that are deleted in the spice variant (see Figure 17 of the instant application for a 
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sequence alignment) and share the same structural features. Accordingly, the two polypeptides share 
a common utility which is based upon shared structural and functional features and, thus, have a 
unity of invention. Moreover, a search of the full length polypeptide would by default also search 
the splice variant and, thus, no additional searches that would pose undue burden on the Examiner 
would be necessary. Therefore, SEQ ID NO:20 is not directed toward an invention that is distinct 
from the invention originally claimed, and Applicant respectfully requests that the claims directed 
toward SEQ ID NO:20 be considered and examined by the Examiner. 

Rejection of Claims 31-36 and 41-45 Under 35 U.S. C. § 112, First Paragraph 

The Examiner has rejected claims 31-36 and 41-45, under 35 U.S.C. § 1 12, first paragraph 
because, "the specification, while being enabling for the practice of a method of identifying a 
ligand which binds to a receptor protein comprising the amino acid sequence presented in SEQ 
ID NO:5 of the instant specification, does not reasonably provide enablement for the practice of a 
binding assay which employs a protein having anything less than the entire amino acid sequence 
presented in SEQ ID NO:5" (Emphasis added). The Examiner is further of the opinion that 

[t]he instant specification specifically identifies notable structural features of a protein of 
the instant invention. It neither provides the identification of the expendable residues in 
SEQ ID NO: 5 nor even a single working example of a functional protein lacking its 
entire native amino acid sequence. 

Applicant traverses this rejection for the following reasons. As indicated in Applicant's 
previous reply, the teachings in Applicant's specification, and the knowledge generally available in 
the art at the time of filing would allow a skilled artisan to practice the claimed methods using only 
routine experimentation. 

Contrary to the Examiner's assertions, Applicant has taught in the instant specification which 
regions of the VR-2 molecules are important for activity and, thus, which regions of the molecule 
would respond in a binding assay in a manner which is representative of the native protein. 
Specifically, as taught in the specification (at, for example, page 10, lines 1-35; page 66, lines 30-34; 
and page 67, lines 1-1 1), the VR-2 polypeptide contains ankyrin repeats, transmembrane domains, 
and at least one proline rich domain, all characteristic and necessary for the function of the 
Capsaicin/Vanilloid family of receptors. Based on these teachings in Applicant's specification, the 
skilled artisan would be equipped to use a fragment containing one or more of these functional 
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domains to identify compounds that bind to a domain and modulate an activity of the 
Capsaicin/Vanilloid receptor. Compounds identified in such a manner would be of practical value, 
e.g., they could be used as antagonists of a Capsaicin/Vanilloid receptor. For example, as taught in 
Applicant's specification a fragment of a VR-2 polypeptide containing the proline rich domain may 
be used by the skilled artisan to identify compounds capable of binding the proline rich domain and 
disrupting interactions with SH3 domain-containing proteins that act downstream of the VR-2 
receptor in a pain signaling pathway. Such a compound would be expected to inhibit the function of 
the VR-2 polypeptide by disrupting the VR-2 signaling pathway. 

Further, contrary to the Examiner's assertion, Applicant does disclose a working example of 
a fragment of SEQ ID NO:5. SEQ ID NO:20 is a splice variant of SEQ ID NO:5 that is identical in 
sequence to SEQ ID NO:5 with the exception of a 134 residue deletion from residue 529-663 (see 
the sequence alignment presented in Figure 17). Moreover, in Example 5 Applicant discloses 
specific fragments of SEQ ID NO: 5 that were used to generate antibodies against human VR2 (see 
SEQ ID NOs:13, 14 and 15 at page 70 of the specification). 

The Examiner further states that, "Applicant has cited a plurality of patent documents in 
traversal of this rejection. Applicant is advised that none of the patents cited were available at the 
time the instant application was filed." Applicant respectfully submits that, although the patents 
submitted were not issued at the time Applicant filed the instant application, the patents were 
submitted prior to Applicant's filing date and, thus, prove that the methods disclosed therein were 
common and well known to those of ordinary skill in the art at the time of Applicant's filing. 

With respect to the newly added claims 46-49 which are directed to methods of identifying 
compounds that modulate polypeptides that are 95% identical to the hVR2 molecules, Applicant 
would like to point the Examiner's attention to Example 14 Revised Interim Written Description 
Guidelines Training Materials. Example 14 provides that a claim directed to variants of a protein 
having SEQ ID NO:3 "that are at least 95% identical to SEQ ID NO:3 and catalyze the reaction of 
A-^B" with an accompanying specification that discloses a single species falling within the claimed 
genus, satisfies the requirements of 35 U.S.C. § 1 12, first paragraph for written description. The 
rationale behind the foregoing conclusion, as presented by the Written Description Guidelines, is 
that "[t]he single species disclosed is representative of the genus because all members have at least 
95% structural identity with the reference compound and because of the presence of an assay which 
Applicant provided for identifying all of the at least 95% identical variants of SEQ ID NO:3 which 
are capable of the specified catalytic activity." The Guidelines also provide that "ft/he procedures 
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for making variants of SEQ ID NO:3 are conventional in the art and an assay is described which 
will identify other proteins having the claimed catalytic activity. Moreover, procedures for making 
variants of SEQ ID NO:3 which have 95% identity to SEQ ID NO:3 and retain its activity are 
conventional in the art"" 

Similarly, in the present case, the claims are directed to methods which use isolated nucleic 
acid molecules encoding hVR-2 polypeptides comprising an amino acid sequence that is at least 
95% identical to the amino acid sequence shown in SEQ ID NOs:5 and 20, wherein the polypeptide 
is capable of modulating membrane excitability. Furthermore, Applicant has disclosed in the instant 
specification assays for identifying all of the at least 95% identical variants of SEQ ID NO:5 and 20 
which encode polypeptides capable of modulating membrane excitability (see, for example, the 
patch-clamp methods taught by the Applicant in Example 4). Modulation of membrane excitability 
is readily testable by one of skill in the art by the methods described in the instant specification and 
by methods well-known in the relevant art. Accordingly, it would require only routine 
experimentation on the part of one of skill in the art to mutate the VR-2 molecules of the invention 
and test them for the ability to modulate membrane excitability as described in the specification. 

Based on the foregoing teachings in Applicant's specification and the knowledge generally 
available in the art at the time of filing, Applicant respectfully submits that the skilled artisan would 
be able to make and use the claimed invention using only routine experimentation. Accordingly, 
Applicant respectfully requests that the Examiner reconsider and withdraw the foregoing rejection. 

Rejection of Claim 42 Under 35 U.S. C. § 112, First Paragraph 

The Examiner has rejected claim 42 under 35 U.S.C. § 1 12, first paragraph because, "[i]t was 
well know in the art prior to the time of the instant application that the yeast two-hybrid system can 
only employ small soluble proteins as shown in Figure 1 of the Chien et al publication." The 
Examiner is further of the opinion that 

[o]ne of ordinary skill in the art of molecular biology would immediately 
recognize that a yeast two-hybrid system which employed all of the amino acid 
sequence of SEQ ID NO: 5 would be inoperative because the presence of the six 
transmembrane domains within that sequence (Figure IB of Julius et al Pat. 
No. 6,335,180). In so far as this claim encompasses a system employing only a 
soluble portion of the amino acid sequence of SEQ ID NO:5, the instant 
specification does not disclose a practical utility which is to be realized from the 
identification of a protein which bind[s] to a portion of that amino acid 
sequence. 
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Applicant traverses the forgoing rejection for the following reasons. Contrary to the 
Examiner's assertions, employing fragments of SEQ ID NO: 5 in yeast two hybrid assays would be 
useful in finding modulators that disrupt pain signaling mechanisms. One of ordinary skill in the 
art would immediately recognize the potential applications of using fragments of a transmembrane 
protein in screening assays to identify compounds that are capable of binding to an insoluble 
transmembrane protein. Identification of a compound that binds to a fragment of a transmembrane 
protein is useful in the modulation of the activity of the protein. For example, a particularly useful 
application would be the identification of a compound that is capable of binding to an extracellular 
region of the transmembrane protein and either blocks natural ligand binding, or itself modulates 
hVR-2 activity in some other manner. Applicant's specification discloses the presence of proline 
rich domains in hVR-2 polypeptides. Proline rich domains are known to interact with SH3 domains 
in proteins that act upstream or downstream of hVR-2 in a pain signaling pathway and, thus, the 
proline rich domains of VR-2 are useful in screening for modulators that disrupt the pain signaling 
mechanism. Moreover, Applicant's specification provides a hydrophobicity plot for hVR-2 that an 
ordinary skilled artisan would be able to use to identify those regions of the protein that are on the 
cell surface. The use of these portions of the protein would allow for the identification of 
compounds that themselves modulate the activity of the receptor, or through steric effects, abolish 
the ability of the receptor to bind to other ligands. 

Based on the foregoing teachings in Applicant's specification, and the knowledge generally 
available to one skilled in the art at the time of filing, it is obvious that the ordinary skilled artisan 
would realize the applications of using fragments of hVR-2 in a two-hybrid system, and would 
further be able to make and use the claimed invention using only routine experimentation. 
Accordingly, Applicant respectfully requests that the Examiner reconsider and withdraw the 
foregoing rejection. 

Rejection of Claims 27, 29, 31-34, 36, 37, 39 41, and 43-45 under 

35 ILS.C. 102 (e) 

The Examiner has rejected claims 27, 29, 31-34, 36, 37, 39 41, and 43-45, under 35 U.S.C. 
§ 1 02(e) as being anticipated by Julius et al (United States Patent No. 6,335, 1 80 B 1 ). In 
particular, the Examiner is of the opinion that "[t]he amino acid sequence presented in SEQ ID 
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NO: 5 of the instant application is identical to the amino acid sequence presented in SEQ ID NO:36 
of the Julius et al patent." 

Applicant respectfully traverses the aforementioned rejection for the following reasons. As 
evidenced by copies of U.S. Application Serial No: 08/915,461, Provisional Application Serial 
No: 60/072,151and PCT application Serial No. PCT/US98/17466 (submitted herewith as 
Appendices B, C, and D, respectively) to which Julius et al claim priority, SEQ ID NO:36 is not 
disclosed in any of these applications. Accordingly, the 35 U.S.C. § 102(e) date that Julius et al 
are entitled to with respect to SEQ ID NO:36, is January 22, 1999, the filing date of the Julius et 
al patent (6,335,180 Bl). Moreover, Julius et al disclose but do not claim the amino acid 
sequence set forth in SEQ ID NO:36. 

Applicant submits herewith a declaration under 37 CFR §1.131 which indicates that 
Applicant has completed the invention as described and claimed in the above-referenced patent 
application in this country prior to January 22> 1999. Accordingly, Applicant respectfully 
submits that the invention disclosed in the present patent application was reduced to practice by the 
inventor prior to the effective date of the Julius et al reference. As such, the Julius et al reference 
is not available as prior art against the present invention under 35 U.S.C. 102§(e), and Applicant 
respectfully requests that the Examiner reconsider and withdraw this rejection. 

Rejection of Claim 35 under 35 U.S.C. 103 (a) 

The Examiner has rejected claim 35 under 35 U.S.C. § 103(a) as being unpatentable over 
Julius et al Specifically, the Examiner believes 

[bjecause the Julius et al patent disclosed the fact that the receptor described 
therein was naturally expressed in neuronal tissue an artisan would have found it 
prima facie obvious to have expressed that protein recombinantly in a neuronal 
cell line to obtain a more authentic response by that receptor to a test compound. 

Applicant respectfully submits that, in view of the declaration under 37 CFR §1.131 
submitted herewith, the aforementioned rejection is rendered moot. Accordingly, Applicant 
respectfully requests that the Examiner reconsider and withdraw this rejection. 
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Rejection of Claim 42 under 35 U.S.C. 103 (a) 

The Examiner has rejected claim 42 under 35 U.S.C. § 103(a) as being unpatentable over 
Julius et al in view of Chien et al Specifically, the Examiner believes that 

[t]o have incorporated portions of the capsaicin receptor of Julius et al into 
the yeast two hybrid system of Chien et al to identify proteins with might 
interact therewith would have been prima facie obvious to one of ordinary 
skill in the art. 

Applicant respectfully submits that, in view of the declaration under 37 CFR §1.131 
submitted herewith, the Julius et al patent is not available as prior art against the instant application. 
The Chien et al publication, by itself, is not sufficient to render claim 42 obvious to one of ordinary 
skill in the art. Accordingly, Applicant respectfully requests that the Examiner reconsider and 
withdraw this rejection. 



If a telephone conversation with Applicant's attorney would expedite the prosecution of the above- ' 
identified application, the Examiner is urged to call the undersigned at (617) 227-7400. 



SUMMARY 



Respectfully submitted, 



LAHIVE & COCKFIELD, LLP 



Maria Laccotripe Zacharakis, Ph.D. 
Attorney for Applicant 
Limited Recognition Under 37 CFR § 10.9(b) 




28 State Street 
Boston, MA 02109 
Telephone: (617) 227-7400 
Facsimile: (617)742-4214 



Dated: September 25, 2002 
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Appendix A 

27. A method for identifying a compound which binds to a polypeptide comprising the 
amino acid sequence of SEQ ID NO:5, the method comprising: 

a) contacting a cell expressing the polypeptide with a test compound under 
conditions suitable for binding; and 

b) determining whether the test compound binds to the polypeptide, thereby 
identifying a compound which binds to a polypeptide comprising the amino acid sequence of 
SEQ ID NO:5. 

28. A method for identifying a compound which binds to a polypeptide comprising the 
amino acid sequence of SEQ ID NO:20, the method comprising: 

a) contacting a cell expressing the polypeptide with a test compound under 
conditions suitable for binding; and 

b) determining whether the test compound binds to the polypeptide, thereby 
identifying a compound which binds to a polypeptide comprising the amino acid sequence of 
SEQ ID NO:20. 

29. A method for identifying a compound which binds to a polypeptide consisting of the 
amino acid sequence of SEQ ID NO:5, the method comprising: 

a) contacting a cell expressing the polypeptide with a test compound under 
conditions suitable for binding; and 

b) determining whether the test compound binds to the polypeptide, thereby 
identifying a compound which binds to a polypeptide consisting of the amino acid sequence 
of SEQ ID NO:5. 

30. A method for identifying a compound which binds to a polypeptide consisting of the 
amino acid sequence of SEQ ID NO:20, the method comprising: 

a) contacting a cell expressing the polypeptide with a test compound under 
conditions suitable for binding; and 
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b) determining whether the test compound binds to the polypeptide, thereby 
identifying a compound which binds to a polypeptide consisting of the amino acid sequence 
ofSEQ ID NO:20. 

31. A method for identifying a compound which binds to a polypeptide comprising at 
least 15 contiguous amino acids of SEQ ID NO:5, the method comprising: 

a) contacting a cell expressing the polypeptide with a test compound under 
conditions suitable binding; and 

b) determining whether the test compound binds to said polypeptide, thereby 
identifying a compound which binds to a polypeptide comprising at least 1 5 contiguous 
amino acids of SEQ ID NO:5. 

32. The method of any one of claims 27-3 1 , wherein binding of the test compound to the 
polypeptide is detected by the use of an assay for a hVR-2 activity. 

33. The method of claim 32, wherein said hVR-2 activity is modulation of membrane 
depolarization. 

34. The method of claim 32, wherein said hVR-2 activity is modulation of intracellular 
calcium levels. 

35. The method of any one of claims 27-3 1 , wherein said cell expressing said 
polypeptide is a neuronal cell. 

36. The method of any one of claims 27-3 1 , wherein said compound modulates the 
activity of said polypeptide. 

37. A method for identifying a compound which binds to a polypeptide comprising the 
amino acid sequence of SEQ ID NO:5, the method comprising: 

a) contacting a sample comprising the polypeptide with a test compound under 
conditions suitable for binding; and 
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b) determining whether the test compound binds to the polypeptide, thereby 
identifying a compound which binds to a polypeptide comprising the amino acid sequence of 
SEQ ID NO:5. 



38. A method for identifying a compound which binds to a polypeptide comprising the 
amino acid sequence of SEQ ID NO:20, the method comprising: 

a) contacting a sample comprising the polypeptide with a test compound under 
conditions suitable for binding; and 

b) determining whether the test compound binds to the polypeptide, thereby 
identifying a compound which binds to a polypeptide comprising the amino acid sequence of 
SEQ IDNO:20. 

39. A method for identifying a compound which binds to a polypeptide consisting of the 
amino acid sequence of SEQ ID NO:5, the method comprising: 

a) contacting a sample comprising the polypeptide with a test compound under 
conditions suitable for binding; and 

b) determining whether the test compound binds to the polypeptide, thereby 
identifying a compound which binds to a polypeptide consisting of the amino acid sequence 
of SEQ ID NO:5. 



40. A method for identifying a compound which binds to a polypeptide consisting of the 
amino acid sequence of SEQ ID NO:20, the method comprising: 

a) contacting a sample comprising the polypeptide with a test compound under 
conditions suitable for binding; and 

b) determining whether the test compound binds to the polypeptide, thereby 
identifying a compound which binds to a polypeptide consisting of the amino acid sequence 
of SEQ IDNO:20. 



41 . A method for identifying a compound which binds to a polypeptide comprising at 
least 15 contiguous amino acids of SEQ ID NO:5, the method comprising: 

a) contacting a sample comprising the polypeptide with a test compound under 
conditions suitable for binding; and 
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b) determining whether the test compound binds to said polypeptide, thereby 
identifying a compound which binds to a polypeptide comprising at least 1 5 contiguous 
amino acids of SEQ ID NO:5. 

42. The method of any one of claims 27-3 1 or 37-41, wherein binding of said test 
compound to said polypeptide is detected by the use of a yeast two-hybrid assay. 

43. The method of any one of claims 37-41, wherein binding of said test compound to 
said polypeptide is detected by the use of a direct binding assay. 

44. The method of any one of claims 37-41 , wherein binding of said test compound to 
said polypeptide is detected by the use of a competition binding assay. 

45. The method of any one of claims 37-41, wherein said test compound modulates the 
activity of said polypeptide. 

46. A method for identifying a compound which binds to a polypeptide that is 95% 
identical to the amino acid sequence of SEQ ID NO: 5 and is capable of modulating membrane 
excitability in a cell, the method comprising: 

a) contacting a cell expressing the polypeptide with a test compound under 
conditions suitable for binding; and 

b) determining whether the test compound binds to the polypeptide, thereby 
identifying a compound which binds to a polypeptide comprising the amino acid sequence of 
SEQ ID NO:5. 

47. A method for identifying a compound which binds to a polypeptide that is 95% 
identical to the amino acid sequence of SEQ ID NO: 20 and is capable of modulating membrane 
excitability in a cell, the method comprising: 

a) contacting a cell expressing the polypeptide with a test compound under 
conditions suitable for binding; and 
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b) determining whether the test compound binds to the polypeptide, thereby 
identifying a compound which binds to a polypeptide comprising the amino acid sequence of 
SEQ ID NO:20. 

48. A method for identifying a compound which binds to a polypeptide that is 95% 
identical to the amino acid sequence of SEQ ID NO: 5 and is capable of modulating membrane 
excitability in a cell, the method comprising: 

a) contacting the polypeptide with a test compound under conditions suitable for 
binding; and 

b) determining whether the test compound binds to the polypeptide, thereby 
identifying a compound which binds to a polypeptide comprising the amino acid sequence of 
SEQ ID NO:5. 

49. A method for identifying a compound which binds to a polypeptide that is 95% 
identical to the amino acid sequence of SEQ ID NO:20 and is capable of modulating membrane 
excitability in a cell, the method comprising: 



binding; and 

b) determining whether the test compound binds to the polypeptide, thereby 
identifying a compound which binds to a polypeptide comprising the amino acid sequence of 
SEQ ID NO:20. 



a) 



contacting the polypeptide with a test compound under conditions suitable for 
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AGCAGCCGCC 




CTCAGCCTGC 


GGGGCTCCAG 




TCCTCCCGCA 


GCCCCTGCTA 


P TG A G A AGP 


ccc 


AGCAGCCGCC 


apgppptggp 


CTCAGCCTGC 


GGGGCTCCAG 




TCCTCCCGCA 


GCCCCTGCTA 


C TQAG AAGCT 


CCGGGATCCC 


AGCAGCCGCC 


A.CGCCCTQGC 


CTCAGCCTGN 


GMGGCTCCAG 




TCCTCCCGCA 


GCCCCTGCTA 


CTGAGAAGCT 


CCGGGATCCC 


AGCAGCCGCC 


ACGCCCTGGC 


CTCAGCCTGC 


GGGGCTCCAG 


TP AGGCCA AP 


TCCTCCCGCA 


GCCCNTGCTA 


CTGAGAAGC 


CCC 


AGCAGCCGCC 


ACGCCCTGGC 


CTCAGCCTGC 


GGGGCTCCAG 


TCAGGCCAAC 


TCCTCCCGCA 


GCCCCTGCTA 


TTGAGAAGC 


CCC 


AGCAGCCGCC 


ACGCCTTGG : 


CTCAGCCTGC 


GGGGCTCCAG 


TCAGGCCAAC 


TCCTCCCGCA 


GCCCNTGCTA 


TTGAGAAGC 


CCC 


AGCAGCCGCC 


ACGCCNTGGC 


CTC AG j CTG : 


GGGGCTCCAG 


TCAGGCCAAC 


TCCTCCCGCA 


GCCCCTGCTA 


CTGAGAAGCT 


CCGGGATCCC 


AGCAGCCGCC 


ACGCCCTGGC 


CTCAGCCTGC 


GGGGCTCCAG 


TCAGGCCAAC 


TCCTCCCGCA 


GCCCCTGCTA 


CTGAGAAGCT 


CCGGGATCCC 


AGCAGCCGCC 


ACGCCCTGGC 


CTCAGCCTGC 


GGGGCTCCAG 


TCAGGCCAAC 


TCCTCCNGCA 


GCCCTTGNTA 


CTOAGAAGN 


iCC 


AACAGCCGCC 


ACGCCNCGG ; 


CTCAGiCTGi 


GGGGNTCCAG 


TCAGGCCAAC 


CGCGCA 


GCCCCTGCTA 


CTGAGAAGCT 


CCGGGATCCC 


AGCAGCCGCC 


ACGCCCTGGC 


CTCAGCCTGC 


GGGGCTCCAG 


TCAGGCCAAC 






TGANAAGC 


CCC 


AGCAGCCNNC 


ACTCCNTGGC 


CNNAACCTGC 


GGGGCTCCAN 


NNN19NCCAAC 






TGAGAAGC 


CCC 


AGCAGCCGCC 


ACGCCCTGGC 


CTCAGCCTGC 


GGGGCTCCAG 


TCAGGCCAAC 






TGAGAAGC 


CCC 


AGCAGCCGCC 


ACGCCCTGGC 


CTCAGCCTGC 


GGGGCTCCAG 


TCAGGCCAAC 






TGAGAAGC 


CCC 


AGCAGCCGCC 


ACGCCCTGGC 


CTCAGCCTGC 


GGGGCTCCAG 


TCAGGCCAAC 






c 


ACGCGTCCGC 


AGCAGCCGCC 


ACGCCCTGGC 


CTCAGCCTGT 


GGGGCTCCAG 


TCAGGCCAAC 






C 


ACGCGTCCGC 


AGCAGCCGCC 


ACGCCCTGGC 


CTCAGCCTGT 


GGGGCTCCAG 


TCAGGCCAAC 






C 


ACGCGTCCGC 


AGCAGCCGCC 


ACGCCCTGGC 


CTCAGCCTGT 


GGGGCTCCAG 


TCAGGCCAAC 








ACGCGTCCGC 


AGCAGCCGCC 


ACGCCCTGGC 


CTCAGCCTGT 


GGGGCTCCAG 


TCAGGCCAAC 










GCAOCCGC I 


ACGCCCTGGC 


CTCAGCCTGC 


GTGGCTCCAG 


TCAGGCCAAC 










GCAGCCGCC 




CTCAGCCTGT 


GGGGCTCCAG 


TCAGGCCAAC 












TOGC 


CTCAGCNTGC 


GGGGCTCCAG 


TCAGGCCAAC 
















TCCAG 


TCAGGCCAAC 
















AG 


TCAGGCCAAC 


TCCTCCCGCA 


GCCCCTGCTA 


CTGAGAAGCT 


CCGGGATCCC 


AGCAGCCGCC 


ACGCCCTGGC 


CTCAGCCTGC 


GGGGCTCCAG 


TCAGGCCAAC 


S S R 8 


P C Y 


E A 


P G S C 


, Q P P 


R P G 


L S L R 


G S S 


Q A N 


• + 


• + 


+ +• 


* • • • • • 4 


• ++ • 


• • • • 


++ • • + • 


• + + 


+ + ++ + 
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flhXc21ellz2 4271 
IKflhXc21ellzl #271 
Oenbank N2&„ #270 
Oenbank N29~ #257 
f lhXc21elli2 #231 
4?|flhXc21ellg2 #158 
eihxc21ellgl #140 
^)flhX21ellrl9„ #110 
4f3flhX21ellrl9... #ilO 
flhXc21ellhl #110 
flhX21ellr21v2 #87 
*Hl £ lhXd21ellbl #70 
IP3flhXc21ellil #70 
flhXc21ellbl #70 
i?UflhXc21ellb2 #70 
jthsal03e2b2 #62 
4?5|jthsal03e2bl #62 
$P!jthsal03e2t2 #62 
jPHjthsal03e2tl #61 
4@ Oenbank AA45-. #50 
<f$l jthsal03e02tl„. #50 
^flhX21ellr21a2 #35 
f lhX21ellr21al #16 
Jf|flhX21ellr21g2 #13 
<P) Oenbank H20,- >#i> 
Oenbank H50„ >#l> 
Oenbank H49-. >#1> 
j@ flhX21ellr20... >#i> 
flhX21ellr20.. >#i> 
m £lnX21ellr22„ >#l> 
flhX21ellr22 w >#i> 
f lhX21ollr20-. >#!> 



ACCOACOCOC 
ACCOACOCGC 
ACCGACOCGC 
ACCOACOCOC 
ACCOACOCOC 
ACCGACOCGC 
ACCGACOCGC 
ACCQACGCOC 
ACCGACOCGC 
ACCNACO : GC 
ACCGACGCGC 
ACCOACOCOC 
ACCGACOCGC 
ACCGACOCGC 
ACCGACGCGC 
ACCOACOCGC 
ACCGACGCGC 
ACCGACGCGC 
ACCGACGCGC 
ACCOACOCGC 
ACCOACOCGC 
ACCQACGNGC 
ACCGACGCGC 
ACCGACONGC 



AOCTOGOAOO 

AGCTQOOAGG 

AiNTGGGAGG 

A i NTGGQAGO 

AGCTGOGAGG 

AQCTOOGAGO 

AGCTGOGAGG 

AGCTGOGAGG 

AQCTOOGAGO 

A : NTTGGAON 

AGCTGOGAGG 

AQTTGGGNGG 

AGNTOGGAOG 

AGCTGOGAGG 

AGCTGOGAGG 

AGCTGOGAGG 

AGCTGGGAOG 

AGCTGGGAGG 

AGCTGGGAOG 

A i CTGOGAGG 

AGCTGGGAGG 

AGCT : GGAOG 

AGCT : GGAGO 

AGNTOGGAOG 



A AG AC AGO A C 

AAAACANGAC 

A AO AC AGO AC 

AAGACAOOAC 

AAGACAOOAC 

AAOACAGGAC 

AAGACAGQAC 

AAOACAGGAC 

AAGACAOOAC 

AAOACAGGAC 

AAGACAOOAC 

AUG AC AGO AC 

AANACAGOAC 

AAOACAGGAC 

AAGACAOOAC 

AAGACAGGAC 

AAOACAGGAC 

AAOACAGGAC 

AAGACAGGAC 

AAGACAGGAC 

AAGACAGGAC 

AAGACAGGAC 

AAOACAGGAC 

AAGACAGGAC 



CCTTGACATC 

CCTTGACATC 

CCTTGACATC 

CCTTGACATC 

CCTTGACATC 

CCTTGACATC 

CCTTOACATC 

CCTTGACATC 

CCTTGACATC 

CCTTGACATC 

CCTTGACATC 

CCTTGACATC 

CCTTGACATC 

CCTTGACATC 

CCTTGACATC 

CCTTGACATC 

CCTTGACATC 

CCTTGACATC 

CCTTGACATC 

CCTTGACATC 

CCTTGACATC 

CCTTGACATC 

CCTTGACATC 

CCTTGACNTC 



TNCATNTG C A 
TNCATCTGCA 
TCCATCTGCA 
TCCATCTOCA 
TCCATCTOCA 
TCCATCTGCA 
TCCATCTOCA 
TCCATCTGCA 
TCCATCTGCA 
TCCATCTOCA 
TCCATCTOCA 
TCCATCTGCA 
TCCATCTGCN 
TCCATCTOCA 
TCCATCTOCA 
TCCATCTGCA 
TCCATCTGCA 
TCCATCTGCA 
TCCATCTGCA 
TCCATCTGCA 
TCCATCTGCA 
TCCATCTGCA 
TCCATCTGCA 
TCCATCTGCA 



CAAANOTCCT 
CANANGTCCT 
CAGAOGTCCT 
CAGAGOTCCT 
CAQAGOTCCT 
CAGAGOTCCT 
CAGAGOTCCT 
CAGAGOTCCT 
CAGAGOTCCT 
CANAGGTCCT 
CAGAOGTCCT 
CAQAGGTCCT 
CANAGGTCCT 
CAGAGOTCCT 
CAGAGOTCCT 
CAGAGOTCCT 
CAQAGGTCCT 
CAGAGOTCCT 
CAQAGGTCCT 
CAGAOGTCCT 
CAGAGOTCCT 
CAGAGOTCCT 
CAQAGGTCCT 
CAGAGOTCCT 
GAOAGGTCCT 
AGAGGTCCT 
AGAGGTCCT 



OOCTOOACCO 
OOCTOQACCG 
GQCTOOACOA 
GOCTOOACCQ 
OGCTOOACCG 
GGCTGGACCO 
GGCTGOACCO 
OGCTOOACCG 
GGCTGOACCO 
GGCTGGACCO 
GGCTGOACCO 
OGCTOOACCG 
GGCTGGACCG 
GGCTGOACCO 
GGCTGOACCO 
GGCTGGACCO 
GGCTGGACCG 
GGCTGGACCG 
GGCTGGACCG 
GOCTGO : CCG 
GGCTOGACCG 
GGCTGGACCO 
GOCTOOACCG 
GGNTGGACCG 
GGCTGGACCA 
GGCTOGACNC 
GGCTGQACAT 
Q 
G 
O 
G 
G 



AGCAGCCTNC 
AGCAGCCTNC 
NGCAGCCTCC 
AGCAQCCTCC 
AQCAGCCTCC 
AOCAOCCTCC 
AGCAQCCTCC 
AOCAOCCTCC 
AGCAQCCTCC 
AOCAOCCTCC 
AGCAGCCTCC 
AQCAGNCTNC 
AOCAOCCTCC 
AQCAGCCTCC 
AGCAQCCTCC 
AGCAGCCTCC 
AQCAGCCTCC 
AGCAGCCTCC 
AGCAGCCTCC 
AQCAGCCTCC 
AGCAGCCTCC 
AGCAGCCTCC 
AGCAGCCTCC 
AGCAQCCTCC 
NGCAGCCTCC 
: OCAGCCTCC 
iGCAQCCTCC 
AGCAGCCTCC 
AGCAGCCTCC 
AGCAGCCTCC 
AGCAGCCTCC 
AGCAGCCTCC 



TNCT rCTAGG 
TNCT : CTAGG 
TCCTCC Q 
TCCTCC G 
TCCTCCTAGG 
TCCTCCTAOG 
TCCTCCTAGG 
TCCTCCTAOG 
TCCTCCTAGG 
TCCTCCTAGG 
TCCTNCTAGG 
TNCTCCTAGG 
TCCTCCTAGG 
TCCTCCTAGG 
TCCTCCTAGG 
TCCTCC G 
TCCTCC G 
TCCTCC G 
TCCTCC G 
TCCTCC G 
TCCTCC G 
TCCTCCTAGG 
TCCTCCTAGG 
TCCTCCTAGG 
TCCTCC G 
TCCTCC G 
TCCTCC G 
TCCTCCTAGG 
TCCTCCTAGG 
TCCTCCTAGG 
TCCTCCTAGG 
TCCTCCTAGG 



#271 ACCGACGCGC AGCTGGGAGG AAGACAGGAC CCTTGACATC TCCATCTGCA CAGAGOTCCT GGCTGGACCG AQCAGCCTCC TCCTCCTAGG 
TDAQ LOG R Q. D P.HL HLH RGP GWTE QPP PPR 
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Sequenchsa?* "21ellracefinal" 
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«f| Genbank N28... #360 
«@ Qenbank N29 w #347 
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^flhX21ellrl9-. #200 
4f| £IhXc21ellhl #200 
Jf?|flhX21ellr2L- #177 
<f| £ lhXd21el lbl #160 
flhXc21ellil #160 
4?f)flhXc21ellbl #160 
f lhXc21ellb2 #160 
4@ jthsal03e2b2 #152 
4^)jthsal03e2bl #152 
iEl jthsal03e2t2 #152 
i^jthsal03e2tl #151 
<f|) Genbank AA4-. #140 
4^1 jthsal03e02t.. #140 
m flhX21ellr21... #125 
4f§|flhX21ellr21- #106 
H|flhX21ellr2L.. #103 
Genbank H201-. #41 
Genbank H504 H . #40 
4^1 Genbank H491. M #40 
4^) £lhX21ellr20gl #22 
4SflhX21ellr20g2 #22 
4P)flhX21ellr22a2 #22 
i£HflhX21ellr22al #22 
£lhX21ellr20a2 #22 
£lhx21ellrrl... >#i> 
4K flAX21ellrie~ >#1> 
$?)flhX21ellrl8« >»1> 
i@flhX21ellrl6-. >#1> 
4SflhX21ellria. >#1> 
4^|flhX21ellrl5- >#1> 
flhX21ellrl5«. >#1> 
i@flhX21ellr20„ >#1> 
iRUlhX21ellrl8... >#1> 
|f§)flhX21ellrl7- >#l> 
flhX21ellrl7« >#1> 
@£lhX21ellrl7» >#l> 
flhX21ellrl5- >#1> 
iSDflhX21ellrl5_ >#1> 
\® flhX21ellrl8... >#l> 
j@flhX21ellrl6. H >#l> 
IKlflhX21ellrrl« >#1> 
j£3flhx2lellrl7... >#l> 
H f lhX21ellrl7.- >#!> 
l£lhX21ellrl6 M >#1> 
flhX21ellrl6- >#l> 
Genbank AA7-. >#1> 
£lhX21ellr22„ >#1> 
Genbank AA7« >#i> 
£lhX21ellrl9„ >#l> 
£lhX21ellrl9. R >#1> 
Genbank AIU. >#i> 



ATGACCTCAC 
ATGACCTCAC 
ATGACCTCAA 
ATGACCTCAC 
ATGACCTCAC 
ATGACCTCAC 
ATGACCTCAC 
ATGACCTCAC 
ATGACCTCAC 
ATGACCTCAC 
ATGACCTCAC 
ATGACCTCAC 
ATGACCTCAC 
ATGACCTCAC 
ATGACCTCAC 
ATGACCTCAC 
ATGACCTCAC 
ATGACCTCAC 
ATGACCTCAC 
ATGACCTCAC 
ATGACCTCAC 
ATGACCTCAC 
ATGACCTCAC 
ATGACCTCAC 
ATGACCTCAC 
ATGACCTCAC 
ATGACCTCAC 
ATGACCTCAC 
ATGACCTCAC 
ATGACCTCAC 
ATGACCTCAC 
ATGACCTCAC 



CCTNCAGCTC 
CCTNCAACTC 
CCTCCAGCTC 
CCTCCAGCTC 
CCTCCAGCTC 
CCTCCAGCTC 
CCTCCAGCTC 
CCTCCAGCTC 
CCTCCAGCTC 
CCTCCAGCTC 
CCTCCAGCTC 
CCTCCAGCTC 
CCTCCAGCTC 
CCTCCAGCTC 
CCTCCAGCTC 
CCTCCAGCTC 
CCTCCAGCTC 
CCTCCAGCTC 
CCTCCAGCTC 
CCTCCAGCTC 
CCTCCAGCTC 
CCTCCAGCTC 
CCTCCAGCTC 
CCTCCAGCTC 
CCTCCAGCTC 
CCTCCAGCTC 
CCTCCAGCTC 
CCTCCAGCTC 
CCTCCAGCTC 
CCTCCAGCTC 
CCTCCAGCTC 
CCTCCAGCTC 



TNCAGTTTTT 
TTCAGTTTTT 
TCCAGTTTTC 
TCCAGTTTTC 
TCCAGTTTTC 
TCCAGTTTTC 
TCCAGTTTTC 
TCCAGTTTTC 
TCCAGTTTTC 
TCCAGTTTTC 
TCCAGTTTTC 
TCCAGTTTTN 
TCCAGTTTTC 
TCCAGTTTTC 
TCCAGTTTTC 
TCCAGTTTTC 
TCCAGTTTTC 
TCCAGTTTTC 
TCCAGTTTTC 
TCCAGTTTTC 
TCCAGTTTTC 
TCCAGTTTTC 
TCCAGTTTTC 
TCCAGTTTTC 
TCCAGTTTTC 
TCCAGTTTTC 
TCCAGTTTTC 
TCCAGTTTTC 
TCCAGTTTTC 
TCCAGTTTTC 
TCCAGTTTTC 
TCCAGTTTTC 
CCCACGCGTC 
GTTTTC 
GTTTTC 
GTTTTC 
GTTTTC 
GTTTTC 
GTTTTC 
GTTTTC 
GTTTTC 
TC 
TC 
TC 
TC 
C 
C 



i 



AGGTTGGAGA 
AAGNTGGAGA 
AGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGANA 
AGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGGGA 
AGGTTGGGGA 
AGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGGGA 
CGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGGGA 
AGGTTGGAGA 
CGGTTGGAGA 
CGGTTGGAGA 
CGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGAGA 
AGGTTGGAGA 
GGTTGGAGA 
GGTTGGAGA 
GGTTGGAGA 
GTTGGAGA 
GTTGGAGA 



CANT A 
CATTATATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATNG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTANATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 
CATTAGATGG 



AGGNCCAAAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
A : G : CAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAAAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
AGGCCAAGAA 
A 



NATGGCTCTN 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
AATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
NATGGCTTTN 
GATGGCTTTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTTTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 
GATGGCTCTG 



AGGCGGACAA 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGAC : G 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGAC AN 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 
AGGCGGACAG 



AN G AAA i CTT 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
ANGAAANCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
A : GAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
AGGAAAGCTG 
GAAAGCTG 
AAAGNTG 
AAAGNTG 
AGCTG 



21ella 

Sequeaxihaa5 M "21ellracefinal'' 



jthEal03©2w2 >#1> 

TG 



0361 ATQACCTCAC CCTCCAQCTC TCCAGTTTTC AGGTTGGAGA CATTAGATGG AGGCCAAOAA GATGGCTCTG AGGCGGACAG AGGAAAGCTG 
MTSP SSS PVP RLET LDG GQE DGSE ADR GKL 
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f lhxd2 lei lbl #2 50 
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4§) jthsal03e2bl #242 
JP) jthsal03e2t2 #242 
iK jthsal03e2tl #241 
<fH Genbank AM.. #230 
4@ jthsal03e02t™ #230 
flhX21ellr21„ #215 
<P! flhX21ellr2L.. #196 
flhX2lellr21... #193 
<f%} Genbank H20-. #131 
Genbank H50~ #130 
Oenbank H49,„ #130 
j^£lhX21ellr20 M #112 
ill £lhX21ellr20-. #112 
JPI flhX21ellr22... #112 
4KflhX21ellr22- #112 
4?DflhX21ellr20- #112 
4fl flhX21ellrrl3- #71 
4l3flijX21ellrl8v2 #67 
JK)flhX21ellrl8vl #67 
flhX21ellrl6g2 #67 
JH fliX21ellrl8g2 #67 
$£HflhX21ellrl5g2 #67 
M% f lhX21ellrlSgl #67 
4£lflhX21ellr20al #67 
4§ flhX21ellrl8a2 #67 
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GATTTTTGGA 
OATTTTGGGA 
OATTTTGGGA 
OATTTTGGGA 
GATTTTGQGA 
GATTTTGOGA 
OATTTTGGGA 
GATTTTGOGA 
GATTTTGGGA 
GATTTTGGOA 
GATTTTGGGA 
GATTTTGGGA 
GATTTTGOGA 
GATTTTGGGA 
GATTTTGGGA 
GATTTTGGGA 
GATTTTGGGA 
GATTTTGGGA 
GATTTTGGGA 
GATTTTGGGA 
GATTTTGGGA 
GATTTTGGGA 
GATTTTGGGA 
GATTTTGGGA 
GATTTTGGGA 
GATTTTGGGA 
GATTTTGGGA 
GATTTTGGGA 
GATTTTGGGA 
GATTTTGGGA 
GATTTTGGGA 
GATTTTGGGA 
GA : TTTGGGA 
GATTTTGGGA 
GATTTTGGGA 
GATTTTGGGA 
GATTTTGQGA 
GATTTTGGGA 
GATTTTQGGA 
GATTTTGGGA 
GATTTTGOGA 
GATTTTGGGA 
GATTTTGOGA 
GATTTTGGGA 
GATTTTGGOA 
GATTTTGGOA 
GATTTTGOGA 
GATTTTGGGA 
GATTTTQGGA 
GATTTTGOGA 
OATTTTGGGA 
GATTTTGGOA 
GATTTTGGGA 
GATTTTQGGA 
OATTTTGGGA 
GATTTNQGAA 
GATTTTGQGA 
GATTTTGGGA 
GATTTTNGGA 



GCGGOCTGNC 
GCGGGCTGCC 
GCGGGCTGCN 
OCOGGCTGCC 
GCGGGCTGCC 
GCGOOCTOCC 
GCGGOCTGCC 
GCGQGCTQCC 
OCOGOCTGCC 
GCOGG 

NCGGQCTGCC 
GCGGGCTGCC 
GCGGGCTG : C 
GCGGGCTGCC 
GCGGGCTGCC 
GCGGGCTGCC 
GCGGGCTGCC 
GCGGGCTGCC 
GCGGGCTGCC 
GCGGGCTGCC 
GCGGGCTGCC 
OCOGGCTGCC 
GCGGGCTGCC 
GCGGGCTGCC 
GCGGGCTGCC 
GCGGGCTGCC 
GCGGGCTGCC 
GCGGGCTGCC 
GCGGGCTGCC 
GCGGGCTGCC 
GCGGGCTGCC 
GCGGGCTGCC 
OCCGGCTG 
GCGGGCTGCC 
GCGGGCTGCC 
OCOGGCTGCC 
QCQGQCTGCC 
OCOGGCTGCC 
NCGGGCTGCC 
GCOQOCTOCC 
GCGGOCTGCC 
OCOQOCTOCC 
GCGGGCTGCC 
OCOGGCTGCC 
GCGGGCTGCC 
GCOGQCTGCC 
OCOGGCTGCC 
GCGGGCTGCC 
GCGGOCTGCC 
OCOGGCTGCC 
OCOGGCTGCC 
GCGGGCTGCC 
GCGGGCTGCC 
GCGGOCTGCC 
GCGGGCTGCC 
OCOGGCTGCC 
QCGGGCTOCC 
GCGGGCTGCC 
NCGGGCTGCC 



TTCNTTGQAO 
TCCCATGGNT 
TCCCATGGAG 
TCCCATGGAG 
TCCCATGGAG 
TCCCATOOAG 
TCCCATGGAG 
TCCCATGGAG 
TCCCATGGAG 

TCCCATGGAG 
TCCCATGGAG 
TCCCATGGAG 
TCCCATGGAG 
TCCCATOOAG 
TCCCATGGAG 
TCCCATGGAG 
TCCCATGGAG 
TCCCATGGAG 
TCCCATGGAG 
TCCCATGGAG 
TCCCATGGAG 
TCCCATGGAG 
TCCCATGGAG 
TCCCATGGAG 
TCCCATGGAG 
TCCCATGGAG 
TCCCATGGAG 
TCCCATGGAG 
TCCCATGGAG 
TCCCATGGAG 
TCCCATGGAG 



TCACAQTT : C ACOGNGANOA CCGGNAATTC CCCCC 



TCACAGTTCC 
TCACAQTTCC 
TCACAQTTCC 
TCACAGTTCC 
TCACAQTTCC 
TCACAGTTCC 
TCACAGTTCC 

TCACAGTTCC 
TCACAGTTCC 
TCACAQTTCC 
TCACAGTTCC 
TCACAGTTCC 
TCACAGTTCC 
TCACAGTTCC 
TCACAGTTCC 
TCACAGTTCC 
TCACAQTTCC 
TCACAGTTCC 
TCACAGTTCC 
TCACAGTTCC 
TCACAGTTCC 
TCACAQTTCC 
TCACAQTTCC 
TCACAGTTCC 
TCACAGTTNC 
TCACAGTTCC 
TCACAGTTCC 
TCACAGTTCC 
TCACAGTTCC 



AGQOCQAOGA 
AGOGCOAGOA 
AGGGCQAGQA 
AQGQCQAQGA 
AGGGCGAOGA 
AQGGCGAGOA 
AGGGCGAGQA 



CCQOAAATT 

CCQQAAATTC 

CCQQAAATTC 

CCQQAAATTC 

CCGOAAATTC 

CCQOAAATTC 

CCQQAAATTC 



OCCCCTCAOA TAAQAGT 

OCCCCTCAOA TAAQAGTCAA CCTCAACTAC 
OCCCCTCAOA TAAGAQTCAA CCTCAACTAC 
GCCCCTCAOA TAAQAGTCAA CCTCAACTAC 
OCCCCTCAOA TAAGAQTCAA CCTCAACTAC 
GCCCCTCAOA TAAGAQTCAA CCTCAACTAC 



AGGGCGAGGA 
AGGGCGAGQA 
AGGGCGAGGA 
AGGGCGAGGA 
AGGGCGAGGA 
AGGGCGAGGA 
AGGGCGAGQA 
AGGGCGAGGA 
AGGGCGAGGA 
AGGGCGAGGA 
AGGGCGAGGA 
AGGGCGAGGA 
AGGGCGAGGA 
AGOGCOAGOA 
AQOOCOAQOA 
AGGGCGAGQA 
AGGGCGAGGA 
AGGGCGAGQA 
AGGGCGAGGA 
AGGGCGAOGA 
AGGGCGAGGA 



CCGOAAATTC 

CCGGAAATTC 

CCQOAAATTC 

CCGOAAATTC 

CCGGAAATTC 

CCGGAAATTC 

CCGGAAATTC 

CCGGAAATT 

CCGGAAATT 

CCGGAAATTC 

CCGGAAATTC 

CCGGAAATTC 

CCGGAAATT 

CCGGAAATT 

CCGGAAATT 

CCGGAAATTC 

CCGGAAATTC 

CCGOAAATTC 

CCGGAAATTC 

CCGGAAATTC 

CCGGAAATTC 



OCCCCTCAGA 
QCCCCTCAAA 
OCCCCTCAOA 
OCCCCTCAOA 
GCCCCTCAOA 
GCCCCTCAOA 
GCCCCTCAOA 
CTCAGA 
CTCAOA 
GCCCCTCAOA 
GCCCCTCAGA 
GCCCCTCAOA 
CTCAOA 
CTCAOA 
CTCAOA 
OCCCCTCAOA 
GCCCCTCAOA 
GCCCCTCAOA 
OCCCCTCAGA 
GCCCCTCAGA 
GCCCCTCAGA 



TAAGAGTCAC 
TAAGAQTCAA 
TAAGAQTCAA 
TAAQAGTCAA 
TAAQAGTCAA 
TAAQAGTCAA 
TAAQAGTCAA 
TAAQAGTCAA 
TAAGAQTCAA 
TAAGAQTCAA 
TAAQAGTCAA 
T A AG AG T C A A 
TAAQAGTCAA 
TAAGAQTCAA 
TAAGAQTCAA 
TAAQAGTCAA 
TAAQAGTCAA 
TAAGAGTCAA 
TAAGAGTCAA 
TAAGAGTCAA 
TAAGAATCAA 



TC 

CCTCAACTAC 
CCTCAACTAC 
CCTCAACTAC 
CCTCAACTAC 
CCTCAACTAC 
CCTCAACTAC 
CCTCAACTAC 
CCTCAACTAC 
CCTCAACTAC 
CCTCAACTAC 
CCTCAACTAC 
CCTCAACTAC 
CCTCAACTAC 
CCTCAACTAC 
CCTCAACTAC 
CCTCAACTAC 
CCTCAACTAC 
CCTCAACTAC 
CCTCAACTAC 
CCTTAACTAC 



TjCCATOGAG 
TCCCATGGAG 
TCCCATGGAG 
TCCCATGGAG 
TCCCATOGAQ 
TCCCATGGAG 
TCCCATOOAG 
TCCCATOGAQ 
TCCCATOGAQ 
TCCCATOOAG 
TCCCATGGAG 
TCCCATGOAO 
TCCCATGGAG 
TCCCATGGAG 
TCCCATGGAG 
TCCCATOOAG 
TCCCATGOAO 
TCCCATGGAG 
TCCCATGGAG 
TCCCATGGAG 
TCCCATGOAO 
TCCCATGGAG 
TCCCATQAAO 
TCCCATOAAO 
TCCCATGGAG 
TCCCATGNAG 



TCACAGTTNC 
TCACAQTTCC 
TCACAGTTCC 
TCACAGTTCC 
TCACAGTTCC 
TCACAQTTCC 
TCACAGTTCC 
TCACAGTTCC 
TCACAQTTCC 
TCACAQTTCC 
TCACAGTTCC 
TCACAGTTCC 
TCACAQTTCC 
TCACAGTTCC 
TCACAGTTCC 
TCACAGTTCC 
TCACAQTTCC 
TCACAQTTCC 
TCACAQTTCC 
TCACAQTTCC 
TCACAQTTCC 
TCACAQTTCC 
TCNCAQTTCA 
TCACAQTTCC 
TCACAQTTCC 
TCACAQTTCC 



AGGGCGAGQA 
AGOGCOAGOA 
AGGGCGAGGA 
AQQGCOAQQA 
AOOGCGAGOA 
AQGOCGAQGA 
AGQOCQAOQA 
AOOGCGAGOA 
AOQOCQAOQA 
AGGGCGAGQA 
AQQGCGAQQA 
AGGGCGAOGA 
AGGGCQAOQA 
AQGQCQAQGA 
AG GQ CO AGO A 
AOOGCGAGOA 
AGGGCGAGGA 
AQGOCOAGGA 
AGGGCGAGQA 
AGGGCGAGGA 
AGGGCGAOGA 
AQOOCOAQOA 
AGQOCQAOGA 
AOQOCQAOQA 
AOOGCOAQOA 
AQOOCOAQOA 



CCGGAAATTC 

CCGOAAATTC 

CCGGAAATTC 

CCGGAAATTC 

CCGOAAATTC 

CCQOAAATTC 

CCQQAAATTC 

CCQOAAATTC 

CCGGAAATTC 

CCGGAAATTC 

CCGOAAATTC 

CCQOAAATTC 

CCQQAAATTC 

CCGOAAATTC 

CCGGAAATTC 

CCGGAAATTC 

CCGOAAATTC 

CCQQAAATTC 

CCGGAAATTC 

CCQOAAATT 

CCGGAAATTC 

CCGGAAATT 

CCGOAAATTC 

CCQQAAATTC 

CCGGAAATT 

CNONAAATTC 



GCCCCT 
GCCCCTCAGA 
GCCCCTCAGA 
GCCCCTCAOA 
OCCCCTCAGA 
GCCCCTCAGA 
GCCCCTCAGA 
OCCCCTCAOA 
OCCCCTCAOA 
OCCCCTCAGA 
GCCCCTCAOA 
GCCCCTCAOA 
OCCCCTCAOA 
GCCCCTCAOA 
OCCCCTCAOA 
OCCCCTCAOA 
OCCCCTCAOA 
GCCCCTCAOA 
GCCCCTCAOA 

CTCAGA 
GCCCCTCAOA 

CTCAGA 
GCCCCTCAGA 
GCCCCTCAOA 

CTCAOA 
GCCCCTCAOA 



TAAQAGTCAA 
TAAGAQTCAA 
TAAGAQTCAA 
TAAGAGTCAA 
TAANAOTCAA 
TAAGAGTCAA 
TAAGAGTCAA 
TAAGAGTCAA 
TAAQAGTCAA 
TAAQAGTCAA 
TAAQAGTCAA 
TAAGAQTCAA 
TAAQAGTCAA 
TAAGAQTCAA 
TAAGAQTCAA 
TAAGAQTCAA 
TAAQAGTCAA 
TAAQAGTCAA 
TAAQAGTCAA 
TAAGAGTCAA 
TAAQAGTCAA 
TAAQAGTCAA 
TAAGAQTCAA 
TAAGAQTCAA 
TAAGAQTCAA 



CCTCAACTAC 

CCTCAACTAC 

CCTCAACTAC 

CCTCAACTAC 

CCTCAACTAC 

CCTCAACTAC 

CCTCAACTAC 

CCTCAACTAC 

CCTCAACTAC 

CCTCAACTAC 

CCTCAACTAC 

CCTCAACTAC 

CCTCAACTAC 

CCTCAACTAC 

CCTCAACTAC 

CCTCAACTAC 

CCTCAACTAC 

CCTCAACTAC 

CCTCAACTAC 

CCTCAACTAC 

CCTCAACTAC 

CCTCAACTAC 

CTTCAACTAC 

CCTCAACTAC 

CCTCAACTAC 



21ella 
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4P31jthsal03e2xl 

flhX21ellrlB» 
m eihX21ellr21» 
j§) flhX21ellrl7-. 
$?$flhX21ellr22« 
ahX21ellrrL- 
Genbank AA4... 
*fH f lhX21ellrrl„ 
<Q f lhX21ellrrL.. 
flhX2lellrrl.„ 
Genbank AA8.- 
4§) Genbank AA2- 
IH jthsal03e2wl 



>fll> 
>ffl> 
>#1> 
>#1> 
>fll> 

>n> 
>n> 
>n> 
>«i> 
>#i> 
>#i> 
>u> 
>u> 



TTTGGGA 


GCGGNCTGCN 


TCCCATGNAG 


TCACAGTTCC 


AGGNCGAGGA 


CCGNAAATTC 


GCCCNTCAGA 


TAAGAGTCAC 


CCTCAACTAC 




TGCC 


TCCCATGGAG 


TCACAGTTCC 


AGGGCGAGGA 


CCGGAAATTC 


GCCCCTCAGA 


TAAGAGTCAA 


CCTCAACTAC 






AG 


TCACAGTTCC 


AGGNCGAGGA 


CCGGAAATTC 


GCCCCTCAGA 


TAAGAGTCAA 


fPTPAAPTlP 
\»\» i, V>nilU X illv 






AG 


TCACAGTTCC 


AGGGCGAGGA 


CCGGAAATTC 


GCCCCTCAGA 


TAAGAGTCAA 


CCTCAACTAC 








TCACAGTTCC 


AGGGCGAGGN 


CCGGAAATTC 


GCCCCTCAGA 


TAAGAGTCAA 


CCTCAACTAC 








TCC 


A : GGCGAGGN 


CGGAAATTTC 


NCCCMTCAGA 


TAAAAOTCAC 


CCTCAACTNC 










AGGGCGAGGA 


CCGGAAATT 


CTCAGA 


TAAGAGTCAA 


CCTCAACTAC 










AGGNCGAAGA 


CCGGAAATTC 


GCCCNTCAGA 














GGGCGAGGA 


CCGGAAATTN 


GCCCNTCAGA 


TAAGAGTCAA 


rPTPAAPTlP 










GGC i AGG : 








CCTCAACTNC 










GGA 


CCGGAAATT 
TC 


CTCAGA 
GCCCCTCAGA 


TAAGAGTCAA 
TAAGAGTCAA 


CCTCAACTAC 
CCTCAACTAC 


















TAG 


GATTTTGGGA 


GCQGGCTGCC 


TCCCATGGAG 


TCACAGTTCC 


AGGGCGAGGA 


CCGGAAATTC 


GCCCCTCAGA 


TAAGAGTCAA 


CCTCAACTAC 


D F G S 


G L P 


P M E 


S Q F Q 


G E D 


R K P 


A P Q I 


R V N 


L N Y 


• + • * 


+ • + « + 


• + • • + • 


+ . • 


• +4- « • • 


+ • •+ * + 


• + • 


• • • 


• • • + • 



21ella 

Sequencher™ "21ellracefinal 



40 f lhXc21ellg2 #428 
4fl<)flhXc21ellgl #410 
flhX21ellrl9«. #380 
4fH € lhX21ellrl9.- #380 
4£?|flhXc21ellhl «380 
flhXc21ellbl #340 
Jf§|flnXc21ellb2 #340 
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Genbank H20„ #221 
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K§)flhX21ellr20.,. #157 
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m flhX21ellrl5.., #152 
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4@ Genbank AA7« #122 
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flhX21ellrl9a2 #98 
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m jthsal03e2w2 #93 
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flhX21ellr22g2 #61 
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Genbank AA46-. #51 
flhX21eUrrl3~ #51 
^flh^lollrm.. #50 



CQAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGA 
CGAAAGGGA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAA : GGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CQAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CNAAAGGGAA 
CGAAA i GGAA 
CGAAAGGGAA 
CGAAAAGGAA 
NGAAAGGGAA 



CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 



TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 



CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 



TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTNCCCGAGA 
TTGACCGAGA 



TCGQCTCTNC 
TCGGCTCNNC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTN 
TCGGCTNTTT 



AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
ATTGCGGGCT 
AATGCGGGCT 



CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCGGGGGTGT 
CCCGGGGTGT 



CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCNAAGQAT 
CCCGAAGGAT 



CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGATGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTCCCAG 
CAGGTGCCAG 
CAGGTCCCAG 
CAGGTGCCAG 



TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
NCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCGGGAT 
TCAGCCGGAT 
TCACCCGGNT 
TCA : NCGQAT 



CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGA I 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CNAAACCGAT 
CC : AANCGAT 



TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCCAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCNAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TNGACCGAGA 
TTGACCGAGA 
TTGACNGAGA 
TTGACCGAGA 



TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTT 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGGTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
A 

TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGNTTTTC 
TCGGCTCTTC 
TCGG : TTTTC 
TCGGCTCTTC 



AATGCGGTCT CCCGGGGTGT : CCCGAGGAT 
AATGCGGTCT CCCGGGGTGT CCCCGAGGAT 



AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCNGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 



CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
NCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 



CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCNAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
(CCCGAGGAT 
CCCCGAGNAT 
CCCCGANNAT 
CCCTGAGGAT 
CCCTGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCTGAGGAT 
CCCTGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 



AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATCGGTTCT 
AATGCGGTCT 
AATCGGOTCT 
AATCGGTTCT 



CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCGGGGGTGT 
CCCGGGGTGT 
CCNOGGGTGT 
CCGGGGGTGT 



CCCCGAGGAT 

CCCCGAGGAT 

CCCCGAGGAT 

CCCCGAGGAT 

CCCCGAGGAT 

CCCCGAGGAT 

CCCCGAGGAT 

CCCCGAGGAT 

CCCCTAGGAT 

CCCCGAGGAT 

CCCCGAGGAT 

CCCCGAGGAT 

CCCCGAGGAT 

CCCCGAGNAT 

CCCCGAGGAT 

CCCCGANGAT 

CCCCGAGGAT 

CCCCGAGGAT 

CCCCGAGGAT 

CCCNGAGGAT 



21ella 

Sequencher™ w 21ellraoefinal" 



eihX21ellrrl4-. #49 
Genbank AA81- #44 
Oenbank AA23* #33 
4fHjthsal03e2wl #4 
f lhX21ollrr9« >#1> 
jthsal03e2x2 >#1> 
Oenbank AIO.., >n> 



CGAAAGGGAA CAGGTGCCA : TCAACCGGAT CCAAACCGAT 
CGAAAGGGAA CAGGTGCCAG TCAGCCGGAT CCAAACCGAT 
CGAAAGG : AA CAGGTGCCAG TCAGCCGGAT CCAAACCGAT 
NGAAAGGGAA CAGGTGCCAG TCAGCCGGAT CCAAACCGAT 
AAAAGGAA CAGGTGCCAG TCAGCCGGAT CCAAACCGAT 
CAG TCAGCCGGAT CCAAACCGAT 



TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 



T : GGGTTTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TGGGCTCTTC 
TCGGCTCTTC 



AATCGGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATCGGTTCT 
AATGCGGTCT 
CT 



CCCGGGGTGT 
CCC i GGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCNGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 



CCCCQAGGAT 
CCCCGAGGAT 
CCC : GAGGAT 
CCCCQAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 



&541 CGAAAGGGAA CAGGTGCCAG TCAGCCGGAT CCAAACCGAT TTGACCGAGA TCGGCTCTTC AATGCGGTCT CCCGGGGTGT CCCCGAGGAT 
RKGT GAS QPD PNRP DRD RLP NAVS RGV PED 



21ella 

Sequencbat™ u 21ellracefinal" 



$P|flnXc21ellg2 #518 
flhXc21ellgl #500 
4PHflhX21ellrl9« #470 
if§flhX21ellrl9_. #470 
m flhXc21ellhl #470 
m flhXc21ellbl #430 
|^flhXc21ellb2 #430 
J@ j thsal03e2 12 #422 
JS) jthsal03e2tl #421 
jthsal03e02t.. #410 
flhX21ellr21~ #395 
Jf|flJiX21ellr21... #376 
$£DflhX21ellr21~ #373 
Genbank H20™ #311 
<f| Oenbank HS0.„ #310 
4@ Oenbank H49.. #310 
flhX21ellr20~ #292 
flhX21ellr20... #292 
flhX21ellr22« #292 
f lhX21ellr22„ #292 
flhX21ellr20- #292 
4KflhX21ellrrl.. #251 
m flhX21ellrl6« #247 
III f lhX21ellrl8-. #247 
flhX21ellrl5- #247 
j^tflhX21ellrl5«. #247 
flhX21ellr20.. #247 
4^flhX31ellrl8-. #247 
flhX21ellrl7,- #243 
flhX21ellrl7» #243 
^| f lhX21ellrl7 H . #243 
flhX21ellrl5^ #243 
4SflhX21ellrl5« #242 
4?HflhX21eUrl8.- #242 
j^flhX2lQllrl6» #241 
4@£lhX21ellrl7. M #240 
4®flhX21ellrl7« #240 
4?|flhX21ellrl6~ #239 
eihX21ellrl6.„ #239 
4© Genbank AA7. H #212 
<K|flhX21ellr22~. #201 
Genbank AA7... #189 
flhX21ellrl9._ #188 
4@flhX21ellrl9- #188 
*f§) Genbank All.. #186 
jthsa!03e2w2 #183 
jthsal03e2xl #176 
S§ £lhX21ellrl8... #165 
m £lhX21ellr21.. #153 
flhX21ellrl7. H #153 
flhX21ellr22- #151 
4@flhX21ellrrl w #144 
Genbank AM. #141 
£lhX21ellrrl. #141 
flhX21ellrrl- #140 
4@£lhX21ellrrl 4 .. #139 
Genbank AA8,„ #134 
i&l Genbank AA2« #123 
l@ jthsal03o2wl #94 



CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGNTTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAT 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGNAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGNAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 
CTGGCTGGAC 



TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TT 

TTTCAGAGTA 
TTNCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TT 

TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTTCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCANAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 
TTCCAGAGTA 



CCTGAGCAAG ACCAGCAAGT 
CCTGAGCAAG ACCAGCAAGT 
CCTGAGCAAG ACCAGCAAGT 
CCTGAGCAAG ACCAGCAAGT 
CCTGAGCAAG ACCAGCAAGT 



CCTGAGCCAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 



ACCAGCAAGT 
ANCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 



ACCTCACCGA CTCGGAATAC ACAGAGGGNT CNA 
ACCTCACCGA CTCGGAATAC ACAGAGGGNT CCA 
ACCTCACCGN CTCGGAAT : C NCAGAGGG 
ACCTCACCGA CTCGGAATAC ACAGAGGGG 
ACCTCACCGA CTCGGAATAC ACAGAGGGCT CCACAGGTAA 

ACCTCrCCGA CTCGGAATAC ACAGAAGGCT TCACAAGTAA 



ACCTCACCGA 
ACCTCACCGA 
ACCTCACCGA 
ACCTCACCGA 
ACCTCACCGA 
ACCTiACCGA 
AC 

ACCTCACCGA 
ACCTCMCCGA 
ACCTCACCGA 
ACCTCACCGA 
ACCTCACCGA 
ACCTCACCGA 



CTCGGAATAC 
CTCGGAATAC 
CTCGGAATAC 
CTCGGAATAC 
CTCGGAATAC 
CTTGGAATAC 

CTTGGAATAC 

CTCGGAA 

CTCGGAAT 

CTCGGAATAC 

CTCGGAATAC 

CTCGGAATAC 



ACAGANGGCT 
ACAGAGGGCT 
ACAGAGGGCT 
ACAGAGGGCT 
ACAGAGGGC 
ACAGAGGGCT 



NCACANGTAA 
CCACAGGTAA 
CCACAGGTAA 
CCACAGGTAA 



CCACAGGTAA 
ACAQAGGGGT CCACAGGTAA 



GACGTGCCTG 

AACGTGCCTG 

GACGTGCCTG 
GACGTGCCTG 
GACGTGCCTG 
GACGTGCCTG 

GACGTGCCTG 

GACGTGCCTG 



ACAGAGGGCT 
ACAGAGGGCT 
ACAGAGGGCT 



CCACAGGTAA GACGTGCCTG 
CCACAGGTAA GACGTGCCTG 
CCACAGGTAA GACGTGCCTG 



CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 
CCTGAGCAAG 



ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAA 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 
ACCAGCAAGT 



ACCTCACCGA CTCGGAATAC AC AG A 
ACCTCACCGA CTCGGAATAC AC AG A 
ACCTCACCGA CTCGGAATAC ACAGA 
ACCTCACCGA CTCGGAATAC NCAGAG 
ACCTCACCGA CTCGGAATAC ACAGAGGGCT 
ACCTCACCGA CTCGGAATAC ACAGAGGGCT 



CCACAGGTAA GACGTGCCTG 
CCACAGGTAA GACGTGCCTG 



ACCTCACCGA 
ACCTCACCGA 
ACCTCACCGA 
ACCTCACCGA 
ACCTCACCGA 
ACCTCACCGA 
ACCT:NCCGA 
ACCTCACCGA 
ACCTCACCGA 
ACCTCACCGA 
ACCTCACCGA 
ACCTCACCGA 
ACCTCACCGA 
ACCTCACCGA 
ACCTCACCGA 
ACCTCACCGA 
ACCTCACCGA 
ACCTCACCGA 
ACCTCACCGA 
ACCTCACCGA 
ACCTCACCGA 
ACCTCNCCGN 
NCCTCACCGA 
ACCTCACCGA 
ACCTCACCGA 
NCCTCACCGA 
ACCTCACCGA 
ACCTCACCGA 
ACCTCACCGA 
ACCTCACCGA 



CTCGGAATAC 
CTCGGAATAC 
CTCGGAATAC 
CTCGGAATAC 
CTCGGAATAC 
CTCGGAATAC 
CTCGGAATAC 
CTCGGAATAC 
CTCGGAATAC 
CTCGGAATAC 
CTCGGAATAC 
CTCGGAATAC 
CTCGGAATAC 
CTCGGAATAC 
CTCGGAATAC 
CTCGGAATAC 
CTCGGAATAC 
CTCGGAATAC 
CTCGGAATAC 
CTCGGAATAC 
CTCGGAATAC 
CTCG 

CTCGGAATAC 
CTCGGAATAC 
CTCGGAATAC 
CTCGGAATAC 
CTCGGAATAC 
CTCGGAATAC 
CTCGGAATAC 
CTCGGAATAC 



ACAGA 

ACAGAGGGCT 

ACAGAGGGCT 

ACAGAGGGCT 

ACAGAGGGCT 

ACAGAGGGCT 

ACAGANGGCT 

ACAGAGGGCT 

NCNGA 

ACAGAGGGCT 

ACAGAGGGCT 

NCAGA 

ACAGAGGGCT 

ACAGAGGGCT 

ACAGAGGGCT 

ACAGAGGGCT 

ACAGAGGGCT 

ACAGAGGGCT 

ACAGAGG 

AC AG 

ACAGA 

ACAGAGGGCT 
ACAGAGGGCT 
ACAGAGGGCT 
ACAGAGGGCT 
ACAGAGGGCT 
ACAGAGGGCT 
ACAGAGGGCT 
ACAGAGGGCT 



CCACAGGTAA 
CCACAGGTAA 
CCACAGGTAA 
CCACAGGTAA 
CCACAGGTAA 
TCACANGTNA 
CCACAGGTAA 



GACGTGCCTG 
GACGTGCCTG 
GACGTGCCTG 
GACGTGCCTG 
GACGTGCCTG 
AACGTGCCTT 
GACGTGCCTG 



CCACAGGTAA GACGTGCCTG 
CCACAGGTAA GACGTGCCTG 



CCACAGGTAA 
CCACAGGTAA 
CCACAGGTAA 
CCACAGGTAA 
CCACAGGTAA 
CCACAGGTAA 



GACGTGCCTG 
GACGTGCCTG 
GACGTGCCTG 

GACGTGCCTG 
GACGTGCCTG 
GACGTGCCTG 



CCACAGGCAA 
CCACAGGTAA 
CCACAGGTAA 
CCACAGGTAA 
CCACAGGTAA 
CCACAGGTAA 
CCACAGGTAA 
CCACAGGTAA 



GACGTGCCTG 
GACGTGCCTG 
GACGTGCCTG 
GACGTGCCTG 
GACGTGCCTG 
GACGTGCCTG 
GACGTGCCTG 
GACGTGCCTG 



21ella 

Sequencher™ u 2 lellxacef ixial " 



<|UlhX21ellrr9a- #89 CTGOCTGGAC TTCCAGAGTA CCTGAGCAAG ACCAGCAAGT ACCTCACCGA CTCGGAATAC ACAGAGGGCT CCACAGGCAA GACGTGCCTG 

<P*jthsal03e2x2 #74 CTGGCTGGAC TTCCAGAGTA CCTGAGCAAG ACCAGCAAGT ACCTCACCGA CTCGGAATAC ACAGAGGGCT CCACAGGTAA GACGTGCCTG 

jg Genbank AIOO- #23 CTGGCTGGAC TTCCAGAGTA CCTGAGCAAG ACCAGCAAGT ACCTCACCGA CTCGGAATAC ACAGAGGGCT CCACAGGTAA GACGTGCCTG 
JeJ jthsal03e2al >#l> 

xgj, CTCGGAATAC ACAGAGGGCT CCACAGGTAA GACGTGCCTG 

463 jthsal03e2a2 >#1> ~ m ^„** m »« 

CTCGGAATAC ACAGAGGGCT CCACAGGTAA GACGTGCCTG 



#631 



flhXc21ellhl #560 
$ID £lhXc21ellb2 4520 
jthsal03e2tl #511 
jthsal03e02t... #500 
^flhX21ellr21 m #465 
4@flnX21ellr21-. #466 
4^1 Genbank H20-. #401 
4^1 Genbank H49.„ #400 
4SflhX21ellr22« #382 
j@£lhX21ellr22 H . #382 
flhX21ellr20.. #382 
flhX21ellr20., #337 
Jf?)flhX21ellrl8.- #337 
4@flnX21ellrl7„ #333 
f lhX21ellrl5.. #333 
4KflhX21ellrl5. B #332 
m flhX21elirl8, u #332 
$f||flhX21ellrl6.. #331 
IHflhX2lellrl7- #330 
4^1 f lhX21ellrl7... #330 
4KcihX21ellrl6.. #329 
i@ Genbank AA7„ #302 
<S) Genbank AA7- #279 
flhX21ellrl9„. #278 
4^flaX21ellrl9„. #278 
Genbank AI1„ #276 
jthsal03e2w2 #273 
jthsal03e2xl #268 
<PDflhX21ellrrl... #234 
4© Genbank AA4~ #231 
4K)flhX21ellrrU. #231 
flhX21ellrrl M . #230 
^£lhX21ellrrl.„ #229 
4§j) Genbank AA8~ #224 
40 Genbank AA2- #213 
m Jthsal03e2wl #184 
4©flhX21ellrr9.„ #179 
m jthaal03e2x2 #164 
Genbank AI0« #113 
jthsal03e2al #41 
jthsal03e2a2 #41 
4@ f lhX21ellrrl... >#1> 



CTGGCTGGAC TTCCAGAGTA CCTGAGCAAG ACCAGCAAGT ACCTCACCGA CTCGGAATAC ACAGAGGGCT CCACAGGTAA GACGTGCCTG 
LAGLPEYLSKTSKYLTDSE 



TEG 
+ + • • 



S T G 
• + • • 



AT : AAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAA 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAANGCTT 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 



TGCT 
TGCT 

TGCTGAACCT TAAGGACNGA GTCAATGCCT GCATTNTGC 

TGCTGAACCT TAAGGACNGA GTCAATGCCT GCATTCTGCC 

TGCTGAAC 

TGCTGAACC 

TG 



ACTGCTGCAG ATCGACAGGG ACTNTGQCAA TCCTNAOCCC 



TGCTGAACC 
TGCTGAACC 
TGCTGAACC 
TGCTGAACC 
AGCTGAACC 
TGCTGAAC 
TGCTGAACC 
TGCTGAACC 
AGCTGAACC 
TGCTGAAC 
TGCTGAAC 
TGCTGAAC 
TGCTGAACC 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 



TAAGGACGGA 

TAAGGACGGG 

TAAGGACGGG 

TAAGGACGGG 

TAAGGACGGA 

TAAGGACGGA 

TAAGGACGGA 

TAAGGACGGA 

TAAGGACGGA 

TAAGGACGGA 

TAAGGACGGA 

TAAGGACGGG 

TAAGGACGGG 

TAAGGACGGG 

TAAGGACGGA 

TAAGGACGGA 

TAAGGACGGA 

TAAGGACGGG 

TAAGGACGGA 

TAAGGACGGA 



GTCAATGCCT 
GTCAATGCCT 
GTCAATGCCT 
GTCAATGCCT 
GTCAATGCCT 
GTCAATGCCT 
GTCAATGCCT 
GTCAATGCCT 
GTCAATGCCT 
GTCAATGCCT 
GTCAATGCNT 
GTCAATGCCT 
GTCAATGCCT 
GTCAATGCCT 
GTCAATGCCT 
GTCAATGCCT 
GTCAATGCCT 
GTCAATGCCT 
GTCAATGCCT 
GTCAATGCCT 



GCATTCTGCC 
GCATTCTGCC 
GCATTCTGCC 
GCATTCTGCC 
GCATTCTGCC 
GCATTCTGCC 
GCATTCTGCC 
GCATTCTGTC 
GCATTCTGCC 
GCATTCTGCC 
GCATTCTGCC 
GCATTCTGCC 
GCATTCTGCC 
GCATTCTGCC 
GCATTCTGCC 
GCATTCTGTC 
GCATTCTGCC 
GCATTCTGCC 
GCATTCTGCC 
GCATTCTGCC 



ACTGCTGCAG 

ACTGCTGCAG 

ACTGCTGCAG 

ACTGCTGCAG 

ACTGCTGCAG 

ACTGCTGCAG 

ACTGCTGCAG 

ACTGCTGCAG 

ACTGCTGCAG 

ACTGCTGCAG 

ACTGCTGCAG 

ACTGCTGCAG 

ACTGCTGCAG 

ACTGCTGCAG 

ACTGCTGCAG 

ACTGCTGCAG 

ACTGCTGCAG 

ACTGCTGCAG 

ACTGCTGCAG 

ACTGCTGCAG 



ATCGACAGGG 
ATCGACAGGG 
ATCGACCGGG 
ATCGACCGGG 
ATCGACAGGG 
ATCGACAGGG 
ATCGACAGGG 
ATCGACAGGG 
ATCGACAGGG 
ATCGACAGGG 
ATCGACAGGG 
ATCGACAGGG 
ATCGACAGGG 
ATCGACCGGG 
ATCGACAGGG 
ATCGACAGGG 
ATCGACAGGG 
ATCGACCGGG 
ATCGACAGGG 
ATCGACAGGG 
AGGG 



ACTCTGGCAA 

ACTCTGGCAA 

ACTCTGGCAA 

ACTCTGGCAA 

ACTCTGGCAA 

ACTCTGGCAA 

ACTCTGGCAA 

ACTCTGGCAA 

ACTCTGGCAA 

ACTCTGGCAA 

ACTCTGGCAA 

ACTCTGGCAA 

ACTCTGGCAA 

ACTCTGGCAA 

ACTCTGGCAA 

ACTCTGGCAA 

ACTCTGGCAA 

ACTCTGGCAA 

ACTCTGGCAA 

ACTCTGGCAA 

ACTNTGGCAN 



TCCTCAGCCC 

TCCTCAGCCC 

TCCTCAGCCC 

TCCTCAGCCC 

TCCTCAGCCC 

TCCTCAGCCC 

TCCTCAGCCC 

TCCTCAGCCC 

TCCTCAGCCC 

TCCTCAGCCC 

TCCTCAGCCC 

TCCTCAGCCC 

TCCTCAGCCC 

TCCTCAGCCC 

TCCTCAGCCC 

TCCTCAGCCC 

TCCTCAGCCC 

TCCTCAGCCC 

TCCTCAGCCC 

TCCTCAGCCC 

TCNTCAGCCC 



•721 ATGAAGGCTG TGCTGAACCT TAAGGACGGA GTCAATGCCT GCATTCTGCC ACTGCTGCAG ATCGACAGGG ACTCTGGCAA TCCTCAGCCC 
MKAV L " L KDG VNAC ILP LLQ IDRD SON PQP 



21ella 

Seqvtencher™ u 2 lellracef inal " 



40jthsalO3eO2t-. «590 
Oenbank AA7„. #392 
40 Oenbank AA7. H #369 
flhX21ellrl9-. #368 
m flhX21ellrl9... #368 
40 Oenbank AIL... #366 
40 jthsal03e2w2 #363 
40jthsalO3e2xl #358 
40£lhX21ellrrl w #324 
40 Oenbank AA4~ #321 
40flhX2lellrrl... #321 
40£lhX21ellrrl™ #320 
40 f lhX21ellrrl... #319 
40 Oenbank AA8.~ #314 
40 Oenbank AA2... #303 
40 jthsal03e2wl #274 
40flhX21ellrr9«. #269 
i0 jthsal03e2x2 #254 
40 Genbank AI0... #203 
10 jthsa!03e2al #131 
40jthsalO3e2a2 #131 
40flhX21ellrrl4-. #25 



CTOQTAAATG 

CTQGTAAATG 

CTGGTAAATG 

CTGGTAAATG 

CTGGTAAATG 

CTOGTAAATG 

CTGGTAAATG 

CTGGTAAATG 

CTGGTAAATG 

CTGGTAAATG 

CTGGTAAATG 

CTGGTAAATG 

CTGGTAAATG 

CTGGTAAATG 

CTQGTAAATQ 

CTGGTAAATG 

CTGQTAAATG 

CTGGTAAATG 

CTGGTAAATG 

CTQGTAAATG 

CTGGTAAATG 

CTGGTAAATG 



CCCAOTGCAC 

CCCAGTGCAC 

CCCAGTGCAC 

CCCAGTGCAC 

CCCAGTGCAC 

CCCAGTGCAC 

CCCAGTGCAC 

CCCAGTGCAC 

CCCAGTGCAC 

CCCAGTGCAC 

CCCAGTGCAC 

CCCAGTGCAC 

CCCAGTGCAC 

CCCAGTGCAC 

CCCAGTGCAC 

CCCAGTGCAC 

CCCAGTGCAC 

CCCAGTGCAC 

CCCAGTGCAC 

CCCAGTGCAC 

CCCAGTGCAC 

CCCAGTGCAC 



AGATGACTAT 

AGATGACTAT 

AGATGACTAT 

AGATGACTAT 

AGATGACTAT 

AGATGACTAT 

AGATGACTAT 

AGATGACTAT 

AGATGACTAT 

AGATGACTAT 

AGATGACTAT 

AGATGACTAT 

AGATGACTAT 

AGATGACTAT 

AGATGACTAT 

AGATGACTAT 

AGATGACTAT 

AGATGACTAT 

AGATGACTAT 

AGATGACTAT 

AGATGACTAT 

AGATGACTAT 



TACCGAGGNC 

TACCQAOOCC 

TACCQAGGCC 

TACCQAOOCC 

TACCQAQQCC 

TACCQAOOCC 

TACCQAOOCC 

TACCQAQQCC 

TACCQAOOCC 

TACCGAGGCC 

TACCGAGOCC 

TNCCGAGGCC 

TACCGAGGCC 

TACCGAGOCC 

TACCOAQQCC 

TACCGAGGCC 

TACCGAGGCC 

TACCGAGGCC 

TACCGAGGCC 

TACCGAGGCC 

TACCQAQQCC 

TACCGAGGCC 



ACAQCGC 

ACAQCOCTCT 

ACAQCO : TCT 

ACAQCOCTCT 

ACAOCGCTCT 

ACAQCOCTCT 

ACAOCGCTCT 

ACAQCOCTCT 

ACAQCOCTCT 

ACAGCGCTCT 

ACAOCGCTCT 

ACAQCOCTCT 

ACAOCGCTCT 

ACAOCGCTCT 

ACAGCGCTCT 

ACAOCGCTCT 

ACAOCGCTCT 

ACAGCGCTCT 

ACAGCGCTCT 

ACAGCGCTCT 

ACAGCGCTCT 

ACAOCGCTCT 



GCACATCOCC 

OCACATCOCC 

GCACATCGCC 

GCACATCGCC 

GCACATCOCC 

GCACATCOCC 

GCACATCOCC 

GCACATCQTC 

GCACATCOCC 

GCACATCGCC 

OCACATCOCC 

GCACATCGCC 

OCACATCOCC 

GCACATCGCC 

GCACATCGCC 

GCACATCQTC 

GCACATCGCC 

GCACATCOCC 

GCACATCGCC 

GCACATCGCC 

GCACATCGCC 



ATT 
ATT 

ATTGAOAAOA 
ATT O AO AAOA 
ATTGAOAAOA 
ATTQAQAAQA 
AT TO AO AAOA 
ATTGAOAAOA 
ATT 

ATTGAOAAOA 
ATTQAQAAQA 
ATTGAOAAOA 
ATT 

ATTGAOAAOA 
ATTGAOAAOA 
ATTOAGAAGA 
ATTGAGAAGA 
ATT 

ATTGAGAAGA 
ATTGAGAAGA 
ATTGAOAAOA 



OQAOTCTQCA 
QQAOTCTGCA 
OQAOTCTQCA 
QQAOTCTGCA 
GOAOTCTQCA 
OGAGTCTGCA 



OTQTQTQAAG 
GTGTGTGAAG 
GTGTQTGAAQ 
GTGTGTGAAG 
GTGTGTGAAG 
GTGTGCGAAG 



OGAGTCTGCA QNQTQTNAAG 
OGAGTCTGCA GTGTGTGAAG 
QQAOTCTGCA GTGTGTGAAG 

GGAQTCTOCA GTGTGTGAAG 
QQAOTCTGCA OTQTGTGAAG 
QQAOTCTGCA OTGTQCGAAG 
OGAGTCTGCA GTGTOTQAAG 

OGAGTCTGCA GTOTGTGAAG 
GGAQTCTOCA GTGTGTGAAG 
GGAGTCTOCA GTGTGTGAAG 



•811 CTQGTAAATO CCCAOTGCAC AGATGACTAT TACCGAGGCC ACAQCOCTCT OCACATCOCC ATTQAQAAQA OQAOTCTQCA QTOTOTOAAQ 
L V N A OCT DD Y VROH SAL H I A IBKR SLQ CVK 



21ella 

Sequencher™ ^lellracefinal" 



^flhX21ellrl9... 
ffli flhX21ellrl9.- 
4^1 Genbank All.. 
$Hjthsal03e2w2 
$0jthsalO3e2xl 

flhX21ollrrl™. 

flhX21ellrrl« 
<fH ClhX21ellrrl.- 
4®l flhX21ellrrl. B 
Genbank AA2... 

jthsal03e2wl 

f lhX21ellrr9„. 
4P?|jthsal03e2x2 
J£!;jthsal03e2al 

jthsal03e2a2 
*fD f lbX21ellrrl„. 
«p| frhobl2c4gl.... 
40 f rhob012c04s... 
iffi f rhob012c04s... 
4H f lhX21ellrha2 

flhX21ellrnrv2 
j£| flhX21ellrlvl 

frtiocl2c4h3.... 
4H f lhX21oHrlv2 
40 f lhX21ellnnvl 
4£iflhX21ellrja2 
4® Genbank AA9« 
4@ Genbank AA9... 



#458 
#458 
#456 
#453 
#448 
#414 
#411 
#410 
#409 
#393 
#364 
#359 
#344 
#221 
#221 
#115 
>#1> 
>#1> 
>#!> 
>#1> 
>fll> 

>#i> 
>n> 
>#i> 
>n> 
>«i> 
>#i> 
>«i> 



CTCCTGGTGG 
CTCCTGGTGG 
CTCCTGGTGG 
CTCCTGGTGG 
CTCCTGGTGG 
CTCCTGGTGG 
CTCCTGGTGG 
CTCCTGGTGG 
CTCCTGGTGG 
CTCCTGGTGG 
CTCCTGGTGG 
CTCCTGGTGG 
CTCCTGGTGG 
CTCCTGGTGG 
CTCCTGGTGG 
CTCCTGGTGG 



AGAATGGGGC 
AGAATGGGGC 
AGAATGGGGC 
AGAATGGGGC 
AGAATGGGGC 
AGAATGGGGC 
AGAATGGGGC 
AGAATGGGGC 
AGAATGGGGC 
AGAATGGGGC 
AGAATGGGGC 
AGAATGGGGC 
AGAATGGGGC 
AGAATGGGGC 
AGAATGGGGC 
AGAATGGGGC 



CAATGTGCAT 
CAATGTGCAT 
CAATG 

CAATGTGCAT 
CAATGTGCAT 
CAATGTGCAT 
CAATGTGCAT 
CAATGTGCAT 
CAATGTGCAT 
CAATGTGCAT 
CAATGTGNAT 
CAATGTGCAT 
CAATGTGCAT 
CAATGTGCAT 
CAATGTGCAT 
CAATGTGCAT 



GCCCGGGCCT GCGGCCGC 
GCCCGGGCCT GCGGCCGC 



GCCCGGGCCT 
GCCCGGGCCT 
GCCCGGGCCT 
GCCCGGGCCT 
GCCCGGGCCT 
GCCCGGGCCT 
GCCCGGGCCT 
GCCCGGGCCT 
GCCCGGGCCT 
GCCCGGGCCT 
GCCCGGGCCT 
GCCCGGGCCT 
GCCCGGGCCT 



GCGGCCGCT 

GCGGCCGCT 

GCGACCGCTT 

GCGGCCGCTT 

GCGGCCGCTT 

GCGGCCGCTT 

G 

GCGGCCGCT 
GCGACCGCTT 
GCGGCCGCT 
GCGGCCGCT 
GCGGCCGCT 
GCGGCCGCTT 
GCGGCCGCTT 
GCGGCCGCTT 
GCGGCCGCTT 
CGGCCGCTT 
GCCGCTT 
CCGCTT 
TT 



CTTCCAGAAG GGCCAAGGGA CTTGCTTTTA TTTCGGTGAG 
NTTCCAGAAG GGCCAAGGGN CTTGCTTTTA TTTNGGTNAG 
CTTCCAGAAG GGCCAAGGGA CTTGCTTTTA TTTCGGTGAG 
CTTCCAGAAG GGCCAAGGGA CTTGCTTTTA TTTCGGTGAG 



CTTCCAGAAG GGCCAAGGGA CTTGCTTTTA TTTCGGTGAG 



CTTCCAGAAG 
CTTCCAGAAG 
CTTCCAGAAG 
CTTCCAGAAG 
CTTCCAGAAG 
CTTCCAGAAG 
CTTCCAGAAG 
CTTCCAGAAG 
TTCCAGAAG 
AGAAG 



GGCCAAGGGA 
GGCCAAGGGA 
GGCCAAGGGA 
GGCC : AGGGA 
GGCCAAGGGA 
GGCCAAGGGA 
GGCCAAGGGA 
GGCCAAGGGA 
GGCCAAGGGA 
GGCCAAGGGA 
AGGGA 



CTTGCTTTTA 
CTTGCTTTTA 
CTTGCTTTTA 
CTTGCTTTTA 
CTTGCTTTTA 
CTTGCTTTTA 
CTTGCTTTTA 
CTTGCTTTTA 
CTTGCTTTTA 
CTTGCTTTTA 
CTTGCTTTTA 



TTTCGGTGAG 
TTTCGGTGAG 
TTTCGGTGAG 
TTTCGGTGAG 
TTTCGGTGAG 
TTTCGGTGAG 
TTTCGGTGAG 
TTTCGG : HAG 
TTTCGGTGAG 
TTTCGGTGAG 
TTTCGGTGAG 
TCGGTGAG 
TCGGTGAG 



#901 CTCCTGGTGG AGAATGGGGC CAATGTGCAT GCCCGGGCCT GCGGCCGCTT CTTCCAGAAG GGCCAAGGGA CTTGCTTTTA TTTCGGTGAG 
LLVE NGA NVH ARAC GRP PQK GQGT CFY FGE 

+ • + t + + •+ 



v 



21ella 

Sequencher™ tt 21ellraoefinal" 





C 1 ALLLL 11.1 


v>i i muiiCuc 


TTGCJVPC&AQ 


CAGTGGGATG 


TGGTAAGCTA 


CCTCCTGGAG 


AACCCACACC 


AGCCCGCCAG 


CCTGCAGGCC 


iP<l r 1x1X2 lei im.^ fsoi 




^ii mu^^uc 


1 X ^UAV*\.AAsJ 


CAGTGGGATG 


TGGTAAGCTA 


CCTCCTGGAG 


AACCCACACC 


AGCCCNCCAG 


CCTGCAGGCC 


4F<J flcxziaiim.. bsqo 


L 1ALLLL Itl 




1 lULAV-VAAU 


CAGTGGGATG 


TGGTAAGCTA 


CCTCCTGGAG 


AACCCACACC 


AGCCCGCCAG 


CCTGCAGGCC 




lr J A^U^.^. 1 \» I 




TTGC AC C A AG 


CAGTGGGATG 


TGGTAAGCTA 


CCTCCTGGAG 


AACCCACACC 


GGCCCGCCAG 


CCTGCAGGCC 


401 *1 ViV>1 d1 1 rrO ttil AQ 




CTTTGGCCGC 


TTGCACCAAG 


CAGTGGGATG 


TGGTAAGCTA 


CCTCCTGGAG 


AACCCACACC 


AGCCCGCCAG 


CCTGCAGGCC 


f 1 hX3 1 All rrl ft 5 0 5 


PTAC CC CT CT 


CTTTGNCCGC 


TTGCACCAAQ 


CAGTGGGATG 


TGGTAAGCTA 


CCTCCTGGAG 


AACCCACACC 


GGCCCGCCAG 


CCTGCAGGCC 


^K^l fr-hobiaeAal a~ #51 


CNACCCCTCT 


CTTTGNCCGC 


TTGCACCAAG 


CAGTGGNATG 


TGGTAAGCTA 


CCTCCTGGAG 


AACCCACNCC 


AGCCCGCCAG 


CCTGCAGGCC 


j£j)frhah012c04sl_ #51 


CTACCCCTCT 


CTTTGGCCGC 


TTGCACCAAG 


CAGTGGGATG 


TGGTAAGCTA 


CCTCCTGGAG 


AACCCACACC 


AGCCCGCCAG 


CCTGCAGGCC 


401 frfrol->012e04gl- #51 


CTACCCCTCT 


CTTTNGCCGC 


TTGCACCAAG 


CAGTGGGATG 


TGGTAAGCTA 


CCTCCTGGAG 


AACCCACACC 


AGCCCGCCAG 


CCTGCAGGCC 


f 1 hXOI «1 1 t+ia2 flRfl 




CTTTGGCCGC 


TTGCACCAAG 


CAGTGGGATG 


TGGTAAGCTA 


CCTCCTGGAG 


AACCCACACC 


AGCCCGCCAG 


CCTGCAGGCC 


4e5I FIViVOl oi 1 rrmrO fldfl 
£ XiiAuS 161. J.XIUVZ ft <l O 




^ X X A \J \J \» \* \J \^ 


TTGCACCAAG 


CAGTGGGATG 


TGGTAAGCTA 


CCTCCTGGAG 


AACCCACACC 


AGCCCGCCAG 


CCTGCAGGCC 


lesl i-ixiA-6i.©iir±vi ff« / 






TTGCACCAAG 


CAGTGGGATG 


TGGTAAGCTA 


CCTCCTGGAG 


AACCCACACC 


AGCCCGCCAG 


CCTGCAGGCC 


1p3 trnoclzcanj . a«. 943 




pwnfinppflp 
L 1 1 luULLyL 




C ARTGGGATG 


TGGTAAGCTA 


CCTCCT : GAG 


AACCCACACC 


AGCCCGCCAG 


CCTGCAGGCC 


Ipg cinX2lellrlv2 940 


CTACCCCTCT 






partTflflflATfl 
\>nu i uvjvvn x u 


TGGTAAGCTA 


CCTCCTGGAG 


AACCCACACC 


AGCCCGCCAG 


CCTGCAGGCC 


tP<J tlhX2lellnnvl 93d 


CTACCCCTCT 


(.11 1vjVjL^.vC 


n"r rar 1 app a a rs 


riflTRflflATfl 
\— n\9 X U uU X W 


TGGTAAGCTA 


CCTCCTGGAG 


AACCCACACC 


AGCCCGCCAG 


CCTGCAGGCC 


4§£lhX21ellrja2 #26 


CTACCCCTCT 


CTTTGGCCGC 


TTGCACCAAG 


CAGTGGGATG 


TGGTAAGCTA 


CCTCCTGGAG 


AACCCACACC 


AGCCCGCCAG 


CCTGCAGGCC 


^ Genbank AA992.. #9 


CTACCCCTCT 


CTTTGGCCGC 


TTGCACCAAG 


CAGTGGGATG 


TGGTAAGCTA 


CCTCCTGGAG 


AACCCACACC 


AGCCCGCCAG 


CCTGCAGGCC 


^f-<j GenJDanK AAyy d... ffa 


C A AC LH.1L1 


L 1 1 i \*\aV>V>\Jv> 


TTflrJlPPAAO 
x x sj w #^ *^ V— xVrt 


CAGTGGGATG 


TGGTAAGCTA 


CCTCCTGGAG 


AACCCACACC 


AGCCCGCCAG 


CCTGCAGGCC 




CCCCTCT 


CTTTGGCCGC 


TT GC ACC AAG 


CAGTGGGATG 


TGGTAAGCTA 


CCTCCTGGAG 


AACCCACACC 


AGCCCGCCAG 


CCTGCAGGCC 


4fl f lhX21ellrr9_. >#1> 


CCCCTCT 


CTTTGGCCGC 


TTGCACCAAG 


CAGTGGGATG 


TGGTAAGCTA 


CCTCCTGGAG 


AACCCACACC 


AGCCCGCCAG 


CCTGCAGGCC 


^jthKbOSlaOlt,. >#1> 




CTTTGGCCGC 


TTGCNCCAAG 


CAGNGGGATG 


TGGTAAGCTA 


CCTMCTGGAG 


AACCCACACC 


AGNCCGCCAG 


CCTGCAGGCC 


4^ flhX21ellrrl». >#1> 










TA 


CCTCCTGGAG 


AACCCACACC 


AGCCCGCCAG 


CCTGCAGGCC 


^Hjj f lhX21ellrrl». >#1> 










TA 


CCTCCTGGAG 


AACCCACACC 


AGCCCGCCAG 


CCTGCAGGCC 


0991 


CTACCCCTCT 


CTTTGGCCGC 




TTGCACCAAG 


CAGTGGGATG 


TGGTAAGCTA 


CCTCCTGGAG 


AACCCACACC 


AGCCCGCCAG 


CCTGCAGGCC 




L P L S L A A 


C T K 


Q W D V V S Y 


LLC 


N P H Q PAS 


L Q A 




+ 






+ + 




+. • 


+ 


• + + 






ACTGACTCCC 


AGGGCAACAC 
















4K2i 4T1 UVOU1 1 v-r-1 ACOI 


ACTQACTCCC 


AGGGCAACAC 


NGT 














xllUlfiX6Xxxx7.X« 


ACTGACTCCC 


AGGGCAACAC 


AGT 














4K?1 liMI a1 1 rr*1 i^ftO 


ACTGACTCCC 


AGGGCAACAC 


AGTCC 














JK31 FlViVQI oi 1 *-»-Q ftRIQ 
Tr"VI tJlUUieXXHJ-. 


ACTGACTCCC 


AGGGCAACAC 


AGTC 














Iimilolllti-. ff«33 


ACTGACTCCC 


AGGGCAACAC 


AGTC 
















ANTGACTCCC 


AGGGCAACAC 


AGTCCTGCAT 


GCCCTAGTGA 


TGATGTCGGN 


CAANTCAGCT 


GAGAACATTG 


CACTGGTGAC 


CAGCATGTAT 


JK?i frhnhAI 2s*AAe ftl dl 


ACTGACTCCC 


AGGGCAACAC 


AQTCCTGCAT 


GCCCTAGTGA 


TGATCTCGGA 


CAACTCAGCT 


GAGAACATTG 


CACTGGTGAC 


CAGCATGTAT 


^r<| iroouviACviiBw VJ.4J. 


ACTGACTCCC 


AGGGCAACAC 


AGTCCTGCAT 


GCCCTAGTGA 


TGATCTCGGA 


CAACTCAGCT 


GAGAACATTG 


CACTGGTGAC 


CAGCATGTAT 


Je31 ^1 SV?1 a1 1 v+ibO Hi An 

1P<l £x£uuxej>j.irnaz wiqu 


ACTGACTCCC 


AGGGCAACAC 




GCCCTAGTGA 


TGATCTCGGA 


CAACTCAGCT 


GAGAACA 






^p^| £1uAa JoJIIUIVa ffiJo 


ACTGACTCCC 


AGGGCAACAC 


AGTCCTGCAT 


GCCCTAGTGA 


TGATCTCGGA 


CAACTCAGCT 


GAGAACA 






f lhX3lAllrlvl 11117 


ACTGACTCCC 


AGGGCAACAC 


AGTC CTGC AT 


GCCCTAGTGA 


TGATCTCGGA 


CAACTCAGCT 


GAGAACA 






4Kl £ rhoel 3 c4h3 #133 


ACTGACTCCC 


AGGGCAACAC 


AGTCCTGCAT 


GCCCTAGTGA 


TGATCTCGGA 


CAACTCAGCT 


GAGAACATTG 


CACTGGTGAC 


CAGCATGTAT 


f lhX21ellrlv2 #130 


ACTGACTCCC 


AGGGCAACAC 


AGTCCTGCAT 


GCCCTAGTGA 


TGATCTCGGA 


CAACTCAGCT 


GAGAACA 






^K)flbX21ellrnwl #126 


ACTGACTCCC 


AGGGCAACAC 


AQTCCTGCAT 


GCCCTAGTGA 


TGATGTCGGA 


CAACTCAGCT 


GAGAAC 






{@flhX21ellrja2 #116 


ACTGACTCCC 


AGGGCAACAC 


AGTCCTGCAT 


GCCCTAGTGA 


TGATCTCGGA 


CAACTCAGCT 


GAGAACA 






Oenbank AA99~ «99 


ACTGACTCCC 


AGGGCAACAC 


AGTCCTGCAT 


GCCCTAGTGA 


TGATCTCGGA 


CAACTCAGCT 


GAGAACATTG 


CACTGGTGAC 


CAGCATGTAT 


Genbank AA99- #99 


ACTGACTCCC 


AGGGCAACAC 


AGTCCTGCAT 


GCCCTAGTGA 


TGATCTCGGA 


CAACTCAGCT 


GAGAACATTG 


CACTGGTGAC 


CAGCATGTAT 


£ lhX21allrr9al «88 


ACTGACTCCC 


AGGGCAACAC 


AGTCCTGCAT 


GCCCTAGTGA 


TGATCTCGGA 


CAACTCAGCT 


GAGAACA 






4f2UlWQlellrr9a2 #88 


ACTGACTCCC 


AGGGCAACAC 


AGTCCTGCAT 


GCCCTAGTGA 


TGATCTCGGA 


CAACTCAGCT 


GAGAACA 






^P) jthKbOSlaOltl- #81 


ACTGACTCCC 


AGGGCAACAC 


AGTCCTGCAT 


GCCCTAGTGA 


TGATCTCGGA 


CAACTCAGCT 


GAGAA 






elhX21ellrrlU. #43 


ACTGACTCCC 


AGGGCAACAC 


AGTC 














4§) flhX21ellrrll„ #43 


ACTGACTCCC 


AGGGCAACAC 


AGTN 















#1081 



ACTGACTCCC AGGGCAACAC AGTCCTGCAT GCCCTAGTGA TGATCTCGGA CAACTCAGCT GAGAACATTG CACTGGTGAC CAGCATGTAT 
TDSQ GNT VLH ALVM ISD NSA ENIA LVT SMY 



frhobl2c4gl.~ (1231 
fri»6b012c04s- (1231 
4§| frtobb012c04s_ #231 
4§ f rhocl2c4h3 .... #223 
Genbank AA9... #189 
Genbank AA9-. 0189 
jthLa029clOt™ >#1> 
<f| Genbank W38- >*1> 



21ella 

Sequencher^ u 21ellracef inal " 



GATGGGCTCC 
GATGGGCTCC 
GATQGGCTCC 
GATGGGCTCC 
GATGGGCTCC 
GATGGGCTCC 



TCCAAGCTGG 
TCCAAGCTGG 
TCCAAGCTGG 
TCCAAGCTGG 
TCCAAGCTGG 
TCCAAGCTGG 



GGCCCGCCTC 
GGCCCGCCTC 
GGCCCGCCTC 
GGCCCGCCTC 
GGCCCGCCTC 
GGCCCGCCTC 
CGCCTC 
CCTC 



TGCCCTACCG 
TGCCCTACCG 
TGCCCTACCG 
TGCCCTACCG 
TGCCCTACCG 
TGCCCTACCG 
TGCCCTACCG 
TGCCCTACCG 



TGCAGCTTGA 
TGCAGCTTGA 
TGCAGCTTGA 
TGCAGCTTGA 
TGCAGCTTGA 
TGCAGCTTGA 
TGCAGCTTGA 
TGCAGCTTGA 



GGACATCCGC 
GGACATCCGC 
GGACATCCGC 
GGACATCCGC 
GGACATCCGC 
GGACATCCGC 
GGACATCCGC 
GGACATCCGC 



AACCTNCAGG 
AACCTGCAGG 
AACCTGCAGG 
AACCTGCAGG 
AACCTGCAGG 
AACCTGCAGG 
AACCTGCAGG 
AACCTGCAGG 



ATCTCACGCC 

ATCTCACGCC 

ATCTCACGCC 

ATCTCACGCC 

ATCTCACG 

ATCTCACGCC 

ATCTCACGCC 

ATCTCACGCC 



TCTGAAGCTG 
TCTGAAGCTG 
TCTGAAGCTG 
TCTGAAGCTG 

TCTGAAGCTG 
TCTGAAGCTG 
TCTGAAANTG 



#1171 GATGGGCTCC TCCAAGCTGG GGCCCGCCTC TGCCCTACCG TGCAGCTTGA GGACATCCGC AACCTGCAGG ATCTCACGCC TCTGAAGCTG 
DGLL QAG ARL CPTV QLE D I R NLQD LTP LKL 



<fF3fr*obl2c4gl.... #321 
4$% frhobO12c04fl„. #321 
4§frtt6bO12c04s.. #321 

frhocl2c4h3.... #313 
Genbank AA9~ #279 

jthLa029clOtl« #67 
4@ Genbank W386- #65 
*@ Genbank AA3... >#1> 

#1261 



GCCGCCAAGG 
GCCGCCAAGG 
GCCGCCAAGG 
GCCGCCAAGG 
GCCGCCAAGG 
GCCGCCAAGG 
GCCGCCAAGG 



AGGGCAAGNT 
AGGGCAAGAT 
AGGGCAAGAT 
AGGGCAAGAT 
AGGGCAAGAT 
AGGGCAAGAT 
AGGGCAAGAT 



CGANATTTTC 
CGAGATTTTC 
CGAGATTTTC 
CGAGATTTTC 
CGAGATCTTC 
CGAGATTTTC 
CGAGATTTTC 



AGGCACATCC 
AGGCACATCC 
AGGCACATCC 
AGGCACATCC 
AGGCACATCC 
AGGCACATCC 
AGGCACATCC 



TOCA 

TGCAGCGGGA GTTTTCAGGA CTGAGCCACC 
TGCAGCGGGA GTTTTCAGGA CTGAGCCACC 
TGCAGCGGGA GTTTTCAGGA CTGAGCCACC 
T 

TGCAGCGGGA GTTTTCAGGA CTGAGCCACC 
TGCAGCGGGA GTTNTCAGGA CTGAGCCACC 
TTCAGGA CTGAGCCACC 



TTTCCCGAAA GTTCACCGAG 
TTTCCCGAAA GTTCACCGAG 
TTTCCCGAAA GTTCACCGAG 

TTTCCCGAAA GTTCACCGAG 
TTTCCCGAAA GTTCACCGAG 
TTTCCCGAAA GTTCACCGAG 



GCCGCCAAGG AGGGCAAGAT CGAGATTTTC AGGCACATCC TGCAGCGGGA GTTTTCAGGA CTGAGCCACC TTTCCCGAAA GTTCACCGAG 
AAKE G K I EIP RHIL Q R E FSG LSHL SRK PTE 



4£3 f rt»b012c04s„ 
*H f rhob012c04s... 
4^frtiocl2c4n3.« 

jthLa029cl0t- 
<f§| Genbank W38- 
4§ Genbank AA35.. 

frhbbl2c4cl.-. 
4@ Genbank N24-. 



#411 
#411 
#403 
#157 
#155 
. #38 
>#1> 
>#!> 



#1351 



£rh6b012c04s_ #501 
f rtoob012c04s« #501 
i@frtiocl2c4h3. M . 8493 
4^1 jthLa029cl0t- #247 
Genbank W38- #245 
4^1 Genbank AA3-, #126 
^frhobl2c4cl.a-. #81 
4© Genbank N242-. #31 



TGGTGCTATG 
TGGTGCTATG 
TGGTGCTATG 
TGGTGCTATG 
TGGTGCTATG 
TGGTGCTATG 



GGCCTGTCCG 
GGCCTGTCCG 
GGCCTGTCCG 
GGCCTGTCCG 
GGCCTGTCCG 
GGCCTGTCCG 
GGCCTGTNCG 



GGTGTCGCTG 
GGTGTCGCTG 
GGTGTCGCTG 
GGTGTCGCTG 
GGTGTCGCTG 
GGTGTCGCTG 
GGTGTCNCTG 



TATGACCTGG 
TATGACCT : G 
TATGACCTGG 
TATGACCTGG 
TATGACCTGG 
TATGACCTGG 
TATGACNTGG 



CTTCTGTGGA 
CTTCTGTGGA 
CTTCTGTGGA 
CTTCTGTGGA 
CTTCTGTGGA 
CTTCTNTGGA 
CTTCTGTGNA 



CAGCTGTGAG 
CAGCTGTGAG 
CAGCTGTGAG 
CAGCTGTGAG 
CAGCTGTGAG 
CAGCTGTNAG 
CAGCTGTGAN 



GAG A AC TC AG 
GAGAACTCAA 
GAGAACTCAG 
GAGAACTCAG 
GAGAACTCAG 
GAGAACTCAG 
GAGAACTCAG 
GAGAACTCAG 



TGCTGGAQAT 
TGCTGGAGAT 
TGCTGGAQAT 
TGCTGGAQAT 
TGCTGGAGAT 
TGCTGGAGAT 
TGCTGGAGAT 
TGCTGGAGAT 



CATTGCCTTT 
CATTGCCTTT 
CATTGCCTTT 
CATTGCCTTT 
CATTGCCTTT 
CATTGCCTTT 
CATTGCCTTT 
CATTGNCTTT 



TGGTGCTATG GGCCTGTCCG GGTGTCGCTG TATGACCTGG CTTCTGTGGA CAGCTGTGAG GAGAACTCAG TGCTGGAGAT CATTGCCTTT 
WCYG PVR VSL VDLA SVD SCE ENSV LEI 1AF 
+ + +•++ ++ • + 



CATTGCAAGA 
CATTGCAAGA 
CATTGCAAGA 
CATTGCAAGA 
CATTGCAAGA 
CATTGCAAGA 
CATTGCAAGA 
CATTGCAAGA 



OCCCGCACCG 
ACCCGCACCG 
GCCCGCACCG 
GCCCGNACCG 
GCCCGCACCG 
GCCCGCACCG 
GNCCGCACCG 
GCCCGNACCG 



ACACCGAATG 
ACACCGAATG 
ACACCGAATG 
ACACCGAATG 
ACACCGAATG 
ACACCGAATG 
ACACNOAATG 
ACACCGAATG 



GTCGNTTTGG 
GTCQNTTTGG 
GTCGTTTTGN 
GTCGTTTTGG 
GTCGTTTTGG 
GTCGTTTTGG 
GTCGTTTTGG 
GTCGTTTTGG 



AGCCCCTQAA 
AGCCCCTGAA 
AGCCCCTQAA 
AGCCCCTQAA 
AGCCCCTQAA 
AGCCCCTGAA 
AGCCCCTGAA 
AGCCCCTGAA 



CAAACTGCTG 
CAAACTGCTG 
CAAACTGCTG 
CAAACTGCTG 
CAAACTGCTG 
CAAACTGCTG 
CAAACTGCTG 
CAAACTGCTG 



CANGCGAAAT 
CAGGCGAAAT 
CANGCGAAAT 
CAGGCGAAAT 
CAGGCGAAAT 
CAGNCGAAAT 
CAGGCGAAAT 



GGGATCTGCT 
GGGATCTGCT 
GGGATCTGCT 
GGGATCTGCT 
GGGATCTGCT 
GGGATCTGCT 
GGGATCTGCT 



CAT : CCCAAG 
CATCCCCAAG 
CATNCNCAAG 
CATCCCCAAG 
CATCCCCAAG 
CATCCCCAAG 
CATCCCCAAG 



#1441 



CATTGCAAGA GCCCGCACCG ACACCGAATG GTCGTTTTGG AGCCCCTGAA CAAACTGCTG CAGGCGAAAT GGGATCTGCT CATCCCCAAG 
HCKS PHR H R M VVLE P L N KLL QAKW DLL I P K 
• ++++ + ++ • + 



21ella 

Sequeaxher" 1 vx 21ellracefinal" 



9 f rti6b012c04s„ #591 
:) f rhocl2c4h3 #583 
HjthLa029clOt.. #337 
j) Genbank W3B- #335 
D Genbank AA3.- #218 
I)frhdbl2c4cl.« #171 
D Genbank N24« #121 
iflhX21ellaaL.. >#1> 
Hfrhobl2c4el.-. >#1> 
hohb003T24ia« >#1> 
j johb003T24ia« >#1> 

#1531 



frhobl2c4cl.- #261 
Genbank N24-. #211 
^flhX21ellaal.„. #85 
4^frhobl2c4el.a.„ #85 
if% johb003T24ia(L. #43 
4p|johb003T24iaO H . #43 
£ lhX21ellev2~ >#1> 
4@ f lhX21ellea2- >#l> 
f lhX21elleal>. >#l> 
flhX21elldvl„. >#i> 
flhX2lell<Sal„ >#!> 

#1621 



frltobl2c4cl... #351 
4@ flhX21ellaaU. #175 
^£rbobl2c4el.. H #175 
4@ johb003T24ia™ #133 
40 johb003T24ia~ #133 
<J<)flhX21ellev2.... #89 
<f?)flhX21ellea2..„ #89 
4f§flhX21elleal..- #89 
4@flhX21elldvl. H #89 
4@eihX21ell<3al. M #89 



TTCTTCTTAA 
TTCTTCTTAA 
TTCTTNTTAA 
TTCTTCTTAA 
TTCTTCTTAA 
TTCTTCTTAA 
TTCTTCTTAA 
TTAA 
TTAA 



ACTTCCTG 
ACTTCCTG 
ACTTC 

ACTTCCTGTG 
ACTTCCTGTG 
ACTTCCTGTG 
ACTTCCTGTG 
ACTTCCTGTG 
ACTTCCTGTG 



TAATCTGATC 
TAATCTGATC 
TAATCTGATN 
TAATCTGATN 
TAATCTGATN 
TAATCTGATC 



TACATGTTCA TCTTCA 

TACATGTTCA TCTTCAACGC TGTTGCCTAC CATCAGCCTA CCCTGAAGAA G 

TACATGTTCA TCTTCACCGC TGTTGCCTAC CATCAGCCTA CCCTGAAGAA GCAGGCCGCC 

TACATGTTCA TCTTCACCGC TGTTGCCTAC CATCAGCCTA CCCTGAAGAA GCAGGCCGCC 

TACATGTTCA TCTTCACCGC TGTTGCCTAC CATCAGCCTA CCCTGAAGAA GCAGGCCGCC 

TACATGTTCA TCTTCACCGC TGTTGCCTAC CATCAGCCTA CCCTGAAGAA GCAGGCCGCC 

GC TGTTGCCTAC CATCAGCCTA CCCTGAAGAA GinGCCGCC 

GC TGTTGCCTAC CATCAGCCTA CCCTGAAGAA GiiiGCCGCC 



TTCTTCTTAA ACTTCCTGTG TAATCTGATC TACATGTTCA TCTTCACCGC TGTTGCCTAC CATCAGCCTA CCCTGAAGAA GCAGGCCGCC 
PFLN PLC NLI Y M P I PTA V A Y HQPT LKK Q A A 
+ + ♦ • • * 



CCTCACCTGA 
CCTCACCTGA 
CCTCACCTGA 
CCTCACCTGA 
CCTCACCTGA 
CCTCACCTGA 
TCACCTGA 
TCACCTGA 
TCACCTGA 
TCACCTGA 
TCACCTGA 



AAGCGGAGGT 
AAGCGGAGGT 
AAGCGGAGGT 
AAGCGGAGGT 
AAGCGGAGGT 
AAGCGGAGGT 
AAGCGGAGGT 
AAGCGGAGGT 
AAGCGNAGGT 
AAGCGGAGGT 
AAGNGGAGGT 



TGGAAACTCC 
TGGAAACTCC 
TGGAAACTCC 
TGGAAACTCC 
TGGAAACTCC 
TGGAAACTCC 
TGGAAACTCC 
TGGAAACTCC 
TGGAAACTCC 
TGGAAACTCC 
TGGAAACTCC 



ATGCTGCTGA 
ATGCTGCTGA 
ATGCTGCTGA 
ATGCTGCTGA 
ATGCTGCTGA 
ATGCTGCTGA 
ATGCTGCTGA 
ATGCTGCTGA 
ANGCTGNTGA 
ATGCTGCTGA 
ATGCTGCTGA 



CGGGCCACAT 
CGGGCCACAT 
CGGGCCACAT 
CGGGCCACAT 
CGGGCCACAT 
CGGGCCACAT 
CGGGCCACAT 
CGGGCCACAT 
CGGGCCACAT 
CGGGCCACAT 
CGGGCCACAT 



CCTTATCCTG 
CCTTATCCTG 
CCTTATCCTG 
CCTTATCCTG 
CCTTATCCTG 
CCTTATCCTG 
CCTTATCCTG 
CCTTATCCTG 
CCTTATCCTG 
CCTTATCCTG 
CCTTATCCTG 



CTAGGGGGGA 
CTAGGGGGGA 
CTAGGGGGGA 
CTAGGGGGGA 
CTAGGGGGGA 
CTAGGGGGGA 
CTAGGGGGGA 
CTAGGGGGGA 
CTAGGGGGGA 
CTAGGGGGGA 
CTAGGGGGGA 



TNTACCTCCT 
TCTACCTCCT 
TNTACCTCCT 
TCTACCTCCT 
TCTACCTCCT 
TCTACCTCCT 
TCTACCTNCT 
TNTACCTCCT 
TNTACCTCCT 
TCTACCTCCT 
TNTACCTCCT 



CGTGGGCCAG 

CGTGGG 

CGTGGGCCAG 

CGTGGGCCAG 

CGTGGGCCAG 

CGTGGGCCAG 

CGTGGGCCAG 

CGTGGGCCAG 

CGTGGNCCAG 

CGTGGGCCAG 

CGTGGGCCAG 



CCTCACCTGA AAGCGGAGGT TGGAAACTCC ATGCTGCTGA CGGGCCACAT CCTTATCCTG CTAGGGGGGA TCTACCTCCT CGTGGGCCAG 
PHLK A E V GNS MLLT GHI LIL LGGI Y L L V G Q 
+ + + + + + + 



CTGTGGTACT 
CTGTGGTACT 
CTGTGGTACT 
CTGTGGTACT 
CTGTGGTACT 
CTGTGGTACT 
CTGTGGTACT 
CTGTGGTACT 
CTGTGGTACT 
CTGTGGTACT 



TCTGGCGGCG 
TCTGGCGGCG 
TCTGGCGGCG 
TCTGGCGGCG 
TCTGGCGGCG 
TCTGGCGGNG 
TCTGGCGGCG 
TCTGGCGGCG 
TCTGGCGGCG 
TCTGGCGGCG 



CCACGTGTTC 
CCACGTGTTC 
CCACGTGTTC 
CCACGTGTTC 
CCACGTGTTC 
CCACNTGTTC 
CCACGTGTTC 
CCACGTGTTC 
CCACGTGTTC 
CCACGTGTTC 



ATCTGGATCT 
ATCTGGATNT 
ATCTGGATCT 
ATCTGGATCT 
ATCTGGATCT 
ATCTGGATCT 
ATCTGGATCT 
ATCTGGATCT 
ATCTGGATCT 
ATCTGGATCT 



CGTTCATAGA 
CGTTCATAGA 
CGTTCATAGA 
CGTTCATAGA 
CGTTCATAGA 
CGTTCATAGA 
CGTTCATAGA 
CGTTCATAGA 
CGTTCATAGA 
CGTTCATAGA 



CAGCTACTTT 
CAGCTACTNN 
CAGCTACTTT 
CAGCTACTTT 
CAGCTACTTT 
CAGCTACTTT 
CAGCTACTTT 
CAGCTACTTT 
CAGCTACTTT 
CAGCTACTTT 



GAAATCCTCT 
GAAATCCTCT 
GAAATCCTCT 
GAAATCCTCT 
GAAATCCTCT 
GAAATCCTCN 
GAAATCCTCT 
GAAATCCTCT 
GAAATCCTCT 
GAAATCCTCT 



TCCTGTTCCA 
TCCTGTTCCA 
TCCTGTTCCA 
TCCTGTTCCA 
TCCTGTTCCA 
TCCTGTTNCA 
TCCTGTTCCA 
TCCTGTTCCA 
TCCTGTTCCA 
TCCTGTTCCA 



GGCCCTGCTC 
GGCCCTGCTC 
GGCCCTGCTC 
GGCCCTGCTC 
GGCCCTGCTC 
NGCCCTGCTC 
GGCCCTGCTC 
GGCCCTGCTC 
GGCCCTGCTC 
GGCCCTGCTC 



#1711 



CTGTGGTACT TCTGGCGGCG CCACGTGTTC ATCTGGATCT CGTTCATAGA CAGCTACTTT GAAATCCTCT TCCTGTTCCA GGCCCTGCTC 
LWYP WRR HVF IWIS PID SYP E I L P LPQ ALL 



21ella 

Sequencher™ "2 lellraoef inal " 



Achrb2ieli ~ #425 GGCACTAAGC cagatggcag cccggatgag cgctggtgct tcagggtgga ggaggtgaac tgggcttcat gggagcagac gctgcctacg 

4@frhbbl2c4dl.. H >#1> TCAT GGGAGCAGAC GCTGCCTACG 



#2431 GGCACTAAGC CAGATGGCAG CCCGGATGAG CGCTGGTGCT TCAGGGTGGA GGAGGTGAAC TGGGCTTCAT GGGAGCAGAC GCTGCCTACG 

GTKP DGS PDB RWCP RVE E V N WASW EQT LPT 

4@ AChrb21ell #515 CTGTGTGAGG ACCCGTCAGG GGCAGGTGTC CCTCGAACTC TCGAGAACCC TGTCCTGGCT TCCCCTCCCA AGGAGGATGA GGATGGTGCC 

4@frhobl2c4dl.a~. #25 CTGTGTGAGG ACCCGTCAGG GGCAGGTGTC CCTCGAACTC TCGAGNACCC TGTCCTGGCT TCCCCTCCCA AGGAGGATGA GGATGGTGCC 



#2521 CTGTGTGAGG ACCCGTCAGG GGCAGGTGTC CCTCGAACTC TCGAGAACCC TGTCCTGGCT TCCCCTCCCA AGGAGGATGA GGATGGTGCC 

LCED PSG A G V PRTO ENP V L A S P P K EDE D GA 

4^*1 Achrb2lGll #605 TCTGAGGAAA ACTATGTGCC CGTCCAGCTC CTCCAGTCCA ACTGATGGCC CAGATGCAGC AGGAGGCCAG aggacagagc AGAGGATCTT 

4@frhObl2c4dl.« #115 TCTGAGGAAA ACTATGTGCC CGTCCAGCTC CTCCAGTCCA ACTGATGGCC CAGATGCAGC AGGAGGCCAG AGGACAGAGC ANAGGATCTT 



#2611 TCTGAGGAAA ACTATGTGCC CGTCCAGCTC CTCCAGTCCA ACTGATGGCC CAGATGCAGC AGGAGGCCAG AGGACAGAGC AGAGGATCTT 
SEEN YVP VQL LQSN .WP RCS RRPB DRA EDL 

+ 

<fH Achrb21ell « #695 TCCAACCACA TCTGCTGGCT CTGGGGTCCC AGTGAATTCT GGTGGCAAAT ATATATTTTC ACTAACTCAA AAAAAAAAAA AAAAA 
4@frti6bl2c4dl.«. #205 TCCAACCACA TCTGCTGGCT CTGGGGTCCC AGTGAATTCT GGTGGCAAAT ATATATTTTC ACTAACTCAA AAAAAAAAAA AAAAAAAAAA 



#2701 TCCAACCACA TCTGCTGGCT CTGGGGTCCC AGTGAATTCT GGTGGCAAAT ATATATTTTC ACTAACTCAA AAAAAAAAAA AAAAAAAAAA 
SNHI CWL WGP SEPW WQ1 YIP TNSK KKK KKK 



4@frllobl2c4dl. B . #295 AAAAAAAAAA AAA AAAAA A 



#2791 



AAAAAAAAAA AAAAAAAAA 
K K K K K K 
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